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FOREWORD 


On  a continuing  basis,  the  Army  Mathematics  Steering  Committee  (AMSC) 
sponsors  three  annual  conferences.  These  meetings,  in  the  areas  of  applied 
mathematics,  numerical  analysis  and  statistics,  are  designed  to  promote 
better  communications  among  Army  scientists.  The  oldest  member  of  this 
series,  the  Conference  of  Army  Mathematicians,  held  its  twenty-second 
meeting  at  the  Benet  Weapons  Laboratories,  US  Army  Watervliet  Arsenal, 
Watervliet,  New  York,  on  13-14  May  1976.  Dr.  Moayyed  A.  Hussain,  the 
Chairman  on  Local  Arrangements,  took  this  assignment  seriously,  and  he, 
together  with  other  members  at  Watervliet  Arsenal,  are  due  the  thanks  of 
all  the  attendees  for  an  exceptionally  well-planned  meeting. 

The  ninth  Conference  of  Army  Mathematicians  also  had  as  its  host 
Watervliet  Arsenal.  Statistics  from  these  two  meetings  point  out  some 
of  the  changes  taking  place  in  these  affairs.  The  ninth  Conference  had 
65  attendees,  while  the  present  meeting  entertained  94  persons.  The  1963 
meeting  had  one  invited  speaker  and  24  contributed  papers,  while  the  1976 
Conference  had  6 invited  speakers  and  44  contributed  papers.  The  most 
encouraging  statistic  in  these  figures  is  the  increase  in  the  number  of 
contributed  articles.  While  5 of  the  44  papers  in  this  class  were  given 
by  University  professors,  this  still  leaves  a sizable  increase  in  the 
number  of  scientific  papers  being  presented  by  Army  scientists. 

The  Subcommittee  on  Applied  Mathematics  of  the  AMSC  has  charge  of  the 
planning  of  the  Conference  of  Army  Mathematicians.  It  selects  invited 
speakers  whose  fields  stress  areas  of  applications  of  mathematics  which 
meet  the  needs  of  the  Army.  It  also  selects  some  speakers  that  address 
fields  which  meet  the  special  interests  of  the  host  installation.  From 
the  titles  of  the  addresses  of  the  invited  speakers  listed  below,  one  may 
note  that  the  requirements  of  the  host  in  the  area  of  fracture  mechanics 
is  stressed  in  several  of  these  talks. 

Nonlocal  Elasticity  and  Fracture  Mechanics 

Professor  A.  C.  Eringen,  Princeton  Universi 
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Unsteady  Problems  in  Combustion  Using  Activation  Energy  Asymptotic 
Professor  John  Buckmaster,  University  of  Illinois 

A Return  to  Input-Output  Methods  in  Statistical  Theory 
Professor  Thomas  Kailath,  Stanford  University 

Three-Dimensional  Cracks  and  Weight  Functions 

Dr.  Hans  S.  Bueckner,  General  Electric  Company 

Recent  Developments  in  the  Theory  of  Elasticity  and  Rupture  of 

Fluid  Infiltrated  Solids 

Professor  James  Rice,  Brown  University 

In  addition  to  the  above  speakers.  Professor  George  H.  Handelman 
of  Rensselaer  Polytechnic  Institute  gave  an  invited  address  at  the 
banquet  which  was  held  on  the  first  evening  of  the  Conference. 

Members  of  the  AMSC  were  pleased  that  representatives  of  the  Air 
Force,  the  Navy,  and  the  Department  of  National  Defence  of  Canada  were 
in  attendance  at  this  symposium.  They  were  also  pleased  to  note  the 
host  installation  had  22  of  their  staff  members  listening  to  the  pre- 
sented papers. 

The  last  two  articles  appearing  in  these  Transactions  were  not  given 
at  the  Conference  of  Army  Mathematicians.  These  papers,  one  by  Dr.  Achi 
Brandt  and  the  other  by  Professor  Gene  H.  Golub,  resulted  from  invited 
addresses  delivered  at  the  1976  Army  Numerical  Analysis  and  Computers 
Conference  held  11-12  February  1976  at  the  US  Army  Research  Office. 
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STATE  OF  STRESS  IN  THE  NEIGHBORHOOD 


OF  A SHARP  CRACK  TIP* 

A.  Cemal  Er ingen 
Princeton  University 


ABSTRACT 

Field  equations  of  nonlocal  elasticity  are  solved  to  determine  the 
state  of  stress  in  the  neighborhood  of  a line  crack  in  an  elastic  plate 
subject  to  uniform  tension  perpendicular  to  the  line  of  crack  at  infinity. 

It  is  found  that  no  stress  singularity  is  present  at  the  crack  tip.  When 
the  maximum  hoop  stress  is  equated  to  the  cohesive  stress  Griffith  criterion 
of  fracture  is  obtained  with  the  Griffith  constant  fully  determined. 

Cohesive  stress  necessary  to  break  the  atomic  bonds  are  calculated  for 
A£ , Ni,  Fe,  LiF,  Diamond  and  Zn.  The  results  are  in  excellent  agreement 
with  those  known  in  the  atomic  theory  of  lattices  and  experiments. 

1 • INTRODUCTION 

The  determination  of  the  state  of  stress  near  the  tip  of  a sharp  crack 
in  an  elastic  plate  subject  to  uniform  tension  perpendicular  to  the  line 
of  crack  at  infinity.  Fig.  1,  is  one  of  the  most  fundamental  problems  in 
fracture  mechanics.  The  solution  of  this  problem  was  first  given  by  Inglis 
[1913]  and  it  was  used  by  Griffith  [ 1 9 2 0]  to  establish  his  celebrated 
criterion  for  fracture  of  solids.  The  classical  elasticity  solution  of 
this  problem  gives  a hoop  stress  with  a / r singularity  near  the  crack  tip, 
where  r is  the  distance  from  the  crack  tip.  Thus,  according  to  classical 
elasticity  the  stress  is  infinite  at  the  crack  tip  for  even  a minute 
amount  of  applied  tension.  Since  a plate  with  a sharp  crack  possesses  a 
certain  amount  of  resistance  to  fracture  until  the  applied  tension,  tp, 
reaches  a critical  value  determined  by  the  so-called  Griffith  criterion 

(1.1)  t H = c_ 

O C» 

where  t is  the  half  crack  length  and  C is  an  experimental  constant  (Griffith 
constant),  it  must  be  concluded  that  classical  elasticity  solution  fails 
to  apply  near  the  crack  tin.  This  conclusion  is  responsible  for  the  abandon- 
ment of  maximum  stress  hypothesis  for  failure  which  has  been  prominent  in 
structural  mechanics.  Consequently,  for  brittle  solids,  since  the  time  of 
Griffith,  two  distinct  fracture  criteria  have  been  in  use,  one  for 
structural  members  with  no  cracks  and  one  for  those  containing  cracks.  In 
fact, the  state  of  the  art  is  more  involved,  far  beyond  this  dichotomy,  and 
many  other  fracture  criteria  have  been  introduced  by  other  authors  to  ovei- 

k 
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come  this  stress  singularity  (e.g.,  J-integral,  Barenblatt  theory  [1962] 
Khristianowi ch  [1955]  -Dugdale  theory  [i960],  Goodier  & Kanninen  locally 
nonlinear  theory  [1966],  etc.)*  Below  we  give  a brief  discussion  of 
these  theories.  A thorough  discussion  of  the  status  of  the  art  is  to  be 
found  in  Goodier's  [1968]  article. 

Griffith  Criterion.  Griffith  assumes  that  the  work  done  to  extend  a 
line  crack  of  length  21  an  amount  of  2df  must  be  equal  to  the  work  of 
the  surface  tension.  In  this  way  he  arrived  at  the  formula  (1.1)  with 


(1.2) 


2E 

n ( 1 -V  ) Y 


where  E is  the  Young's  modulus,  v is  the  Poisson's  ratio  for  an  isotropic 
elastic  plate  and  y is  the  surface  tension  energy.  The  surface  tension 
energy  y he  employed  is  that  borrowed  from  fluid  statics.  In  obtaining 
(1.1),  the  Inglis'  solution  for  the  elliptic  hole  was  used  with  the 
provision  that  in  the  limit  the  minor  axis  of  the  ellipse  approach  to 
zero.  This  theory  has  been  under  criticism  for  over  half  a century 
nevertheless  surviving  all  criticisms.  Basic  complaints  may  be  sum- 
marized as: 

(i)  Crack  tip  stress  is  infinite  no  matter  how  small  the  applied  load  is. 

(ii)  The  crack  opens  up  into  an  ellipse,  so  that  the  shear  strain  at  I 
the  tip  is  too  large  (tt/4)  for  the  linear  theory  to  be  applicable. 

(iii)  Ellipse  shrinking  to  a crack  may  not  be  "uniform,"  mathematically, 
i.e.,  other  shapes  may  give  different  limits. 

(iv)  The  surface  tension  energy  y borrowed  from  fluid  statics  may 
not  be  appropriate  for  solids.  • 

Barenblatt  Model.  To  overcome  the  objections  (i)  and  (ii)  Biarenblatt  [1962] 
assumed  that  the  tip  region  of  the  crack  is  not  free  of  tractions  but  there 
exists  a "cohesive  stress,"  a(x),  distributed  in  such  a way  as  to  bring  the 
crack  tip  to  a cusp.  Fig.  2.  He  then  determined  the  shape  of  o (x)  to 
achieve  the  cusp  form. 

Khristianowich-Dugdale  Model . Khristianovich  [1955]  and  Dugdale  [1966] 
assumed  that  beyond  the  crack  tip  over  a small  length  s there  is  a 
constant  cohesive  stress  distribution  to  close  up  ends  of  the  crack, 

Fig.  3. 


Clearly  both  Barenblatt  and  Khristianwich-Dugdale  theories  are 
objectionable  for  their  uses  of  heuristic  assumptions  not  justifiable  on 
the  basis  of  any  physical  principles  or  experimental  work. 

Good  i er-Kann  i non  Model  . According  to  Goodier  and  Kanninen  [1966]  the 
atomic  interactions  arc.  important  at  the  tip  of  a crack.  In  order  to 
overcome  the.  objection  (iii)  they  use  nonlinear  springs  along  the  tip 
of  the  crack.  The  extent  of  nonlinear  springs  and  their  properties  are 
left  to  our  discretion.  While  the  basic  idea  of  inclusion  of  long  range 
interatomic  interactions  are  worthy  of  careful  attention  the  model  contains 
arbitrary  factors  and  functions  to  be  fixed  to  suit  the  purpose. 
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Remarkably  common  to  all  these  models  is  the  unequivocable  realiza- 
tion that  near  the  crack  tip  interatomic  cohesive  forces  must  he 
important. 

There  exist  solutions  of  Inglis'  problem  by  using  polar  theories, 
e.g.,  couple  stress  theory  (Sternberg  and  Muki  [1967]),  micropolar 
theory  (Kim  and  Eringen  [1.973]).  These  results  also  contain  the  same 
type  of  singularities  and  therefore  no  further  progress  is  possible  on 
these  grounds. 

Recently  we  have  developed  a continuum  theory  that  takes  into 
account  the  effect  of  long  range  interatomic  attractions.  According  to 
this  theory  the  stress  at  a point  of  an  elastic  solid  is  influenced  by 
the  strains  at  all  points  of  the  body.  All  known  physical  and  thermo- 
dynamic principles  were  satisfied  (of.,  Eringen [ 1972a ,b] , Eringen  & 
Edelen  [1972]).  When  the  nonlocal  theory  is  employed  for  the  solution 
°f  the  crack  tip  problem  one  finds  that  the  stress  field  at  the  crack 
tip  is  no  longer  singular  and  therefore  it  is  possible  to  revert  back 
to  the  maximum  stress  hypothesis  for  fracture  criterion.  Remarkably 
enough  this  theory  not  only  gives  Griffith's  criterion  without  any  new 
assumption  but  also  determines  the  Griffith  constant.  In  fact  the 
cohesive  stress  calculated  for  various  materials  are  in  excellent 
agreement  with  those  known  from  the  atomic  theory  of  lattices  and 
experiments.  The  main  purpose  of  the  present  paper  is  an  exposition  of 
these  results. 


2.  BASIC  EQUATIONS  OF  NONLOCAI,  EEASTTC1TY 


Basic  equations  of  linear,  homogeneous , isotropic,  nonlocal  elastic 
solids  with  vanishing  body  and  inertia  forces  are  (cf.,  Eringen  [1972b]) 


(2.1) 

tk£,k 

= 0 

(2.2) 

tk £ = 

[A'dx'-xb  o.M(x')  &u  + 2p ' ( | x ' -x  | ) ekf(x')]  dv(x’) 

1/ 

(2.3) 

ek£  = 

h(uk,t  + u£,k} 

where  the  only  difference  from  classical  elasticity  is  in  the  stress 
constitutive  equations  (2.2)  which  states  that  the  stress  t^  «(x)  at  a 
point  x depends  on  strains,  e^o(x'),  at  all  points  of  the  body.  For 
homogeneous  and  isotropic  solids  the  material  moduli  X'(|x'-x|)  and 
p’(|x'-x|)  are  functions  of  the  distance  between  the  points  x'  and  x. 
The  integral  in  (2.2)  is  over  the  volume  F of  the  body  enclosed  within 
the  surface  9 \J . 

Here  and  throughout  we  employ  Cartesian  tensois  with  repeated 
indices  that  indicate,  summation  over  the  range  (1,2,3)  and  indicts 
following  a comma  partial  differentiation,  with  respect  to  space 
coordinates,  e.g. 

uk,£  5 3uk/:;xe 
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r 


In  our  previous  work  f 1 9 72b , 1974]  we  have  obtained  t ho  form  of  A'(|x'-x|) 
and  p'(|x'-x|)  for  which  the  dispersion  curves  of  plane  waves  coincide 
with  those  obtained  in  Born-von  Karm.in  theory  of  lattice  dynamics  within 
the  entire  Brillouin  zone.  Accordingly 


(*',  h')=  (A,u)  a ( | x 1 -x | ) 


(2.4) 


a ( | x ' -x  | ) = 


a0(a  “ 


x -x  < a 


I x -x  > a 


where  a is  the  lattice  parameter  A and  p classical  Lamd  constants  and 
aQ  is  a normalization  constant  to  be  determined  from 


(2.5) 


a(  | x'-x | ) dv(x ' ) = 1 


V 


Since  the  nonlocal  effects  are  most  important  along  the  edge  of  the  crack 
we  use  (2.4)  and  (2.5)  at  x,(  = 0 to  determine  a . This  gives  a = 6/ra^. 
Upon  carrying  (2.4)  into  (2.2)  we  will  have 


(2.6) 


Lkf 


ot ( I x ' —x I ) o^x')  dv(x') 


. ; 


whe  re 

(2.7)  Oj ^(x')  2 X e^(x')  6^  4-  2p  ekf(x') 

is  the  classical  Hooke's  law. 

Substituting  (2.6)  into  (2.1)  and  using  the  identity 
a(|-'"-!),k  = -cx  ( [ x ’ -x  | ) k,  ow,(x') 


ti 

n 


(a  °kl) ,k'  + a ak£,k' 


and  Green-Gauss  theorem  we  obtain 


(2.8) 


- | n(|x'-x|) 

aV 


ok^,(x')  dak(x')  + 


1/ 


a ( 1 x * -x | ) 0^  k.(?')  dv(x')  = 0 


Here  the  surface  integral  may  be  dropped  if  the  effect  of  the  surface 
tensions  are  negligible  or  the  body  extends  to  infinity  in  all  directions. 
We  assume  this  is  the  case  so  that 


(2.9) 


a ( | x ' -x  | ) ak<)  k,(x’)  dv(x') 


0 


It  is  not  difficult  to  prove  that  if  a(|x'-xj)  has  a bounded  support  and 
0kp  i.  is  continuous  in  V tlien  the  necessary  and  sufficient  condition  for 
(2.9;  to  be  satisfied  is,  cf . , Eringen  [1976] 


I 


(2.10) 


V,k  = ° 


Equation  (2.10)  together  with  (2.7)  are  none  other  than  Navier's  equation 
for  the  displacement  field  u(x).  From  this  result  it  follows  that 

Theovem.  The  displacement  field  of  the  nonlocal  elasticity  (under-  the 
conditions  stated  above)  satisfy  Navier’s  equation . 

For  the  displacement  boundary-value  problem  (1st  boundary-value  problem) 
this  implies  that: 

Corollary.  The  displacement  field  of  the  first  boundary  value  problem  of 
the  nonlocal  elasticity  is  identical  to  that  of  the  classical  elasticity . 

Note,  however,  that  to  obtain  the  stress  field  we  must  substitute  a^j, 
obtained  from  the  classical  theory  into  (2.6)  and  carry  out  the  volume, 
integration.  Thus,  for  boundary  conditions  on  the  tractions  we  must  employ 

(2.11)  t^n^.  = on  3l/t 

on  that  part  of  the  surface  3 l’t  where  the  traction  t^  is  prescribed. 


з . CRACK  PROBLEM 

Consider  a plate  weakened  by  a sharp  line  crack  of  length  2 t.  The 
plate  is  subject  to  a uniform  compression  tQ  at  the  crack  surface  and 
free  of  tractionsat  infinity.  The  displacement  field  tij  = u(x,y), 

и. ,  = v(x,y)  in  the  upper  half  plane  y > 0 are  given  by  the  classical 
elasticity  solution  (cf.,  Sneddon  [ 195 L , p.  A 0 A ] ) . 


(3.1) 


co 


_oo 


A(k)  + ( | k | y 


Try1)  B(k)  exp  (-  | k | y 


ikx)  dk 


+ y B(k) 


] 


exp(-|k|y  - ikx)  dk 


where  A(k)  and  B(k)  are  two  functions  to  be  determined  from  the  boundary 
conditions  at  y = 0.  These,  conditions  are: 


(3.2)  t = -t  , y = 0 

yy  o 

v = 0 , y = 0 

To  obtain  the  solution  of  the  crack  problem  with  crack  surface  free  of 
tractions  and  the  plate  is  subject  to  a uniform  tension  tyy  = t0  at  y = 00 
(Fig.  1)  to  Lhe  solution  of  the  above  problem  we  superimpose  a uniform 
stress  field  t = t . 

yy  o 

Substituting  (3.1)  into  (2.3)  and  (2.7)  we  calculate: 


|x|  < ^ 

W ±1 
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°yy  (x'  »y ' ) = - /27  } [lkl  A00  - (x^;  - !kly')  B(k>]  exp(-|k|y' 


(3.3) 


- ikx')  dk 

OO 

0yx(x,*y,)  = - 7§j  [k  A(k)  + (k  y'  ' "IT  Tfif ) I5(k)]  exP C- | k | y * 


ikx')  dk 


According  to  (2.6)  then  we  have 


cyy(x,y) 


a(|x'-x|)  ayy(x',y')  dx'  dy' 


(3. A) 


t (x,y)  = a(|x'-x|)  a (x',y')  dx'  dy' 

j yx 


Substituting  (3.3)  and  (2.4)  into  (3.4)  and  after  carrying  out  integrations 
on  x'  and  y',  we  set  y = 0 in  these  equations  and  in  (3.1)2  to  form  the 
boundary  conditions  (3.2).  As  in  classical  treatment  (3.2)^,  can  be  used 
to  determine  B(k)  in  terms  of  A(k) . The  process  is  lengthy  and  tedious. 

We  only  give  the  resulting  expression 


(3.5)  B(k)  = k 


k[fe  k2a2  + -•)  cos(ka)  + -j  ka  sin(ka) 

+ -j  k3a3  Si(ka)  - jr  ir  k3a3  - A(k)/J(-|-  k2; 

-l_  _ .J_  i-2a2  _ T 4.  k^a1*^  cos  (kil 

3 Hu  10  k 5 20  k / ^ ; 


3 X+p  10 


5 20 


4 (-3  ka  T+^L  - To  ka  + To  k3a3)sin(ka)  + T k3a3  W? 

+ A-k^)si(ka)  -I,k3a3^-i>g|i 

- To  17  k5a5  +f] 


where  Si(z)  is  thc^sine  integral  defined  by 

S i ( x. ) - f -Rla-t  dt 


With  B(k)  given  by  (3.5),  the  boundary  condition  (3.2)  is  satisfied  and 

/o  ON  1 / n o \ 1 1 . -L 


(3.2)^  and  (3.2)^  lead  to 


6 


A(k)  cos(kx)  dk  = 0 


x > l 


(3.6) 


( 2 /it  ) 2 


(2/tt)  • k «(ka)  A(k)  cos(kx)  dk  = T 


x < l 


where 


T : t (A+2p)/2u(A+p) 

o o 


a (ka) 


tt  { + 3)  cos(ka)  + ~ ka  sin(ka) 

+ j k3a3  Si  (ka)  - |tt  k3a3  - | ] 2/ [(|-  kSa5 

+ A k3„3  _ _L  k5„5  i±i  _ A k3„3  ±tH_ 

3 K 10  A+2)j  5 K A+2p 


+ JL  k7  7 ±±H 
20  K a A+2i 


rSir) cos(ka)  + (3  k"a"  ' is  k"°1‘  Tvk) 


+ 7o  k‘aC'  TTSr)  '’T»<ka>  + (3  k6a6  + Jo  k8a“  w,:) Si<ta) 


- -tt  kGa6  - - k3a3  - ~tt  k8a8  — + - k3a3  E-l  l 

6 k 3 k a A0  k A+2p  5 k A+2pJ  I 


The  dual  integral  equations  (3.6)  must  he  solved  to  determine  A(k) . 

When  this  is  done,  we  will  have  the  problem  solved. 

It  is  interesting  to  note  that  in  the  continuum  limit  a—>-0 
a — > 1 and  (3.6)  revert  the  dual  integral  equations  obtained  in  classical 
elasticity  for  the  same  problem.  With  a complicated  kernel  function  cf(ka) 
the  solution  of  (3.6)  cannot  be  affected  in  closed  form.  However,  we 
can  take  advantage  of  the  known  classical  solution  to  reduce  the  problem 
to  a Fredholm  integral  equation  which  is  more  amenable  to  numerical 
treatment.  To  this  end  let  A (k)  denote  the  solution  of  the  dual  integral 
equations  of  the  classical  theory 


(2/tt)  2 A (k)  cos(kx)  dk  = 0 


x > £ 


(2/tt)  2 k A (k)  cos(kx)  dk  - T 
c o 


Subtracting  (3.8)  from  (3.6)  we  will  have 


[A(k)  - A (k)]  cos(kx)  dk  = 0 


OO  wj 

k [A(k)  - Ac00]  cos  (kx)  dk  - k El— « (ka)  ] A(k)  cos(k>-)  dk 


0 
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Treating  the  right-hand  side  of  these  equations  as  known,  we  copy  the 
solution  of  these  equations  from  Sneddon  [1951,  p.  70]. 


A(k)  - A (k)  = (2f‘%0  [J  (kO  (1-n2)'8 
c L o 


0 


C [l-a(ra)] 


1 


0 


(l-u2)2  du 


0 


A ( t ) cosCenf)  dcjdri  + kf 
1 

I r 

f [l-n(fa)]  A(rJ  cos(t-Cnu)  dc  I U? (^kn)dnj 


0 


0 


where  J (z)  and  J^(z)  are  Bessel  functions.  After  carrying  out  integrations 
in  y an9  u we  obtained  the  following  integral  equation  of  the  second  kind 


(3.9) 


where 


A(k) 


oo 

n(n?->;2)  1 [ n J (vc)  J.  (n) 
o i 

0 


J (n) 

o J 


• [ 1-a (nc) ] A(n)  dq  = A£(k) 


k - k £ , q = , c = a/ 1 , 

(3.10)  A (tc)  = (2/ir)®5  [2n(X+p)/i'?to(X+2p)]  A (k) 

A (k)  (2/i0’j  [2n(A+i.)/(’?t  (A+2)i)  ] A (k)  = JAk)/k 

c o c 1 


in  which  the  last  equality  follows  from  the  classical  solution  for  A^ (k) 

in  the  case  of  t = const. 

o 

When  (3.9)  is  solved  for  A (k)  then  the  displacement  and  stress  fields 
follow  from  (3.1),  (3.3)  and  (3.4).  Along  the  crack  line  (y  = 0)  these 
are  given  by 

OO 

v(f.,0)  [2;  (>-lp)/(,\+2ii)]/toC  = j A(k)  cos  (k-O  d K 


(3.11) 


tv  .(C,0)/t  t(C,0)/t  = - 

J ) ° ° 


k a(tcc)  A(r)  cos(kC)  d« 


where  f.  x/f 
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The  integral  equation  (3.9)  is  non-singular  for  ail  e J 0.  For  e = 0 

we  have  A(tc)  = A (k)  = Ji(k)/k.  It  is  also  clear  that  I-oi(ke)  = 0(e2) 
c * 

for  small  r<<l.  The  smallest  length  crack  may  be  constructed  by  one 
missing  atom.  In  this  case  a = (’  and  e = 1.  Thus  0 e < 1 for  a 
micro-crack  of  100  atomic  length  e = 1/50.  It  is  thus  expected  that 
the  contribution  of  the  integral  in  (3.9)  will  become  appreciable  only 
for  submicroscopic  cracks  of  few  atomic  distances.  In  fact,  this  turned 
out  to  be  the  case  when  (3.9)  was  solved  by  means  of  electronic  computers. 

The  numerical  calculations  were  carried  out  over  a two  Brillouin 
zone,  k = 2ir/a,  by  discretizing  (3.9)  over  150  grid  points.  The  results 
will  be  reported  elsewhere.  Here,  however,  we  give  some  typical  cases. 

In  fact,  we  have  found  that  the  classical  solution  A is  perfectly 
satisfactory  for  2 l/a  >_  40  (still  a submicroscopic  crack). 

The  stress  concentration  for  the  case  when  the  crack  surface  is 
free  of  traction  but  the  plate  is  subject  to  uniform  tension  t,r„  = t 

. c ....  . yy  o 

at  infinity  is  given  by 

(3.12)  P(x)  = [tyy(x,0)/to]  + 1 

The  fact  that  the  classical  solution  A of  the  dual  integral  equation  (3.8) 
satisfies  the  boundary  conditions  extremely  well  for  2-C/a  > 40  can  be 
seen  from  Fig.  4.  For  other  details  and  error  estimates  depending  on  e 
the  reader  is  referred  to  Eringen,  et  al  [1976] . 

4 . COHESIVE  STRESS-FRACTURE  CRITERION 
The  stress  concentration  factor 
(4.1)  C (v)  = (2(7a)_l3  P(0 

is  shown  in  Table  1 for  various  Poisson's  ratio  v = X/2(X4-p)  valid  for 
2£/a  > 100.  It  is  clear  that  0.676  _<  C(v)  _<  0.845.  For  v = 0.25, 

C ~ 0.713  for  2£/a  > 100. 


L__ 


By  means  of  (4.1)  we  make  the  following  very  significant  observa- 
tions : 

(i)  The  stress  field  based  on  nonlocal  theory  has  no  singularity  so 
long  as  a / 0.  In  the  continuum  limit  a— >-0,and  the  classical  square 
root  singularity  occurs. 

(ii)  A maximum  stress  hypothesis  can  now  be  used  to  predict  the 
failure.  In  fact,  we  state  that:  Mien  tVy  max  = tc  = cohesive  stress 
the  fracture  will  occur.  From  (3.1)  it  therefore  follows  that 


(4.2) 


t o2l  = [a/2  C2 (v) ] tc2  - CG 


This  is  the  Griffith  criterion  for  brittle  fracture,  with  extra  benefit 
that  the  Griffitli  constant  Cc  is  now  fully  determined.  Interestingly,  no 
ad  hoc  constant  (e.g.,  surface  energy  y)  occurs  in  (4.2)  and  from  the 
value  of  Cq  it  is  clear  that  it  is  a material  property,  t.e.,  if  is 
known  once  the  cohesive  stress  tc,  lattice  parameter  a, and  the  Poisson's 
ratio  v are  known. 
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(iii)  The  verification  of  the  fact  that  fracture  toughness,  classically 
defined  by  Kj  /it t tn  is  a material  property  led  many  experimentalists 

to  carry  out  long  and  arduous  experiments  (cf..  Freed  et  al.  [1971];  Brown 
and  Strawley  [1966]).  If  (4.2)  is  used  we  see  that 

(4.3)  K =(nC  )~'=  (it  a/2)  ''  t /C(v) 

i G c 

is  indeed  a material  property. 

(iv)  Cohesive  stress  tc  may  be  calculated  for  a given  solid  by  use 
of  (4.2).  Griffith  surface  energy  y appearing  in  (1.2)  has  been  the 
subject  of  a great  deal  of  experimentation.  If  we  equate  (1.2)  to  (4.2) 
we  obtain 

(4.4)  tc,?a  - Kv 
where 

(4.5)  K = 8C?-(v)p/tt(1-v) 

Calculations  may  now  be  carried  out  for  various  materials.  Employing 
the  experimental  values  listed  in  Table  2,  we  have  calculated  tc/E  based 
on  the  nonlocal  theory.  The  results  are  recorded  in  the  next  to  the  last 
column  of  Table  2.  The  entries  in  the  last  column  of  this  table  are 
the  estimates  of  tc/E  based  on  atomic  considerations.  Lawn  and  Wilshaw 
[1975,  p.  160]. 

The  remarkably  close  values  obtained  should  be  considered  to  be 
indicative  of  the  far  reaching  power  of  the  nonlocal  theory. 
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TABLE  1 


Stress  Concentration  factor  at  Crack  Tip 

2( 

vs.  Poisson's  Ratio  ( — - = 100) 

a 
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ABLE  2.  Cohesive  Stress. 


Atomistic  results  are  from  Table  7.1,  p.  160,  Lawn  and  Wilshaw  [1975]. 
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FIGURE  1 


Elastic  plate  weakened  by  a crack 
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FIGURE  2 


> ; 

r* 


Barenblatt  model  assumes  that  cohesive  normal  stress  a(x) 
act  at  the  tip  region  of  the  crack  surface.  The  form  of 
o(x)  is  to  be  determined  to  give  cusps  at  tips. 
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FIGURE  3 


Khristianowich-Dugdale  Model 
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ABSTRACT.  In  the  analysis  of  cracks  lying  in  a compressive  stress 
field,  the  classical  solution  of  elasticity  frequently  yields  unacceptable 
physical  results  - often  predicting  an  overlapping  of  the  crack  faces.  A 
first  order  correction  to  these  solutions  can  be  found  by  admitting  crack 
surface  interference  and  searching  for  a physically  compatible  displacement 
field. 

The  problem  of  a center  (or  edge)  cracked  strip  under  in-plane  bending 
is  solved  from  this  viewpoint.  A necessary  condition  for  a physically  com- 
patible solution  is  shown  to  be  the  vanishing  of  the  stress  intensity  factor 
at  the  crack  tip  in  the  otherwise  compressive  field.  Numerical  results  in- 
dicate that  the  classical  solution  for  the  stress  intensity  factors  at  the 
crack  tip  in  the  tensile  field  underestimates  the  corrected  solution  by 
approximately  ten  percent. 

1.  INTRODUCTION.  Every  so  often  the  simplifying  assumptions  of  the 
classical  linear  theory  of  elasticity  can  lead  to  mathematical  solutions 
which  are  physically  unrealistic.  We  are  familiar  with  the  need  for  retain- 
ing the  non-linear  terms  of  the  strain-displacement  relations  to  account 
for  the  instability  or  buckling  phenomena  observed  in  the  behavior  of  thin 
shells.  Another  type  of  deficiency  arises  in  the  analysis  of  configurations 
involving  cracks  lying  in  compressive  stress  fields. 

A simple  example  illustrating  the  subject  of  this  investigation  is  pro- 
vided by  a rectangular  strip  with  a central  crack  loaded  by  a uniform  uni- 
axial compression  normal  to  the  direction  of  the  crack.  Figure  la.  Assuming 
no  friction  across  the  crack  surfaces,  the  obvious  physically  acceptable 
solution  for  this  problem  predicts  the  tangency  of  the  crack  surfaces  AOB 
and  AO'B  with  a stress  state  of  uniform  compression  acting  throughout  the 
strip  and  across  the  crack  surfaces.  Compare  this  solution  with  that  of 
reversing  the  signs  for  uniaxial  tensile  loading  - an  assumption  consistent 
with  the  superposition  argument  of  classical  elasticity.  Clearly  the  re- 
sulting infinite  compressive  stresses  at  the  crack  tips  and  the  negative 
displacements  predicting  an  overlapping  of  the  crack  surfaces  (Figure  lb) 
arrived  at  by  such  an  argument  is  a physically  unacceptable  solution  of  the 
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Figure  1.  Central  crack  in  rectangular  strip  under  uniaxial  compression, 
cr  = - T. 


The  "overlapping"  problem  illustrated  above  carries  over,  usually  more 
subtly,  to  a variety  of  crack  solutions  when  a portion  of  the  crack  configura- 
tion lies  in  a stress  field  which  is  compressive.  A positive  symptom  of 
overlapping  in  the  vicinity  of  a crack  tip  can  be  inferred  from  the  sign  of 
the  stress  intensity  factor  of  linear  fracture  mechanics.  If,  for  example, 

Kj  (the  conventional  Mode  I stress  intensity  component)  is  negative,  then 
there  exists  local  overlapping  at  the  crack  tip. 

A plan  of  modifying  the  classical  solution  by  tolerating  crack  closure 
but  no  overlapping  is  adopted  in  this  paper.  The  problems  corresponding  to 
internal  and  edge  cracks  in  a strip  under  in-plane  bending  are  analyzed  from 
this  viewpoint  and  the  "error"  in  the  classical  solutions  is  assessed. 

2.  CENTRAL  CRACK  IN  AN  INFINITE  SHEET  UNDER  BENDING.  First,  we  con- 
sider the  problem  of  a crack  of  length  2L  with  center  at  Z0  in  an  infinite 
sheet  under  in-plane  bending  (Figure  2).  When  Z0  = 0 , the  crack  is  centrally 
located  with  respect  to  the  applied  load  and  crack  tips  A and  B obviously 
lie  in  compressive  and  tensile  stress  fields,  respectively.  We  shall  now 
show  that  both  the  classical  and  the  modified  solutions  of  this  problem  can 
be  found  by  the  Muskhelishvili  [1]  method  of  analysis. 
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Figure  2.  Crack  in  an  infinite  sheet  under  in-plane  bending. 

The  Muskhelishvili  analysis  depends  on  the  determination  of  two 
analytic  stress  functions  <t>(z)  and  t^(z)  with  the  stresses  and  displacements 
defined  as 

°y  + ax  = + 

ay  ' ax  + 2iTxy  = 2 [ ^ " Cz)  + ♦'(*)]  (1) 

2y(u  + iv)  = Kip(z)  - zip' (z)  - ip(z) 

where  primes  denote  differentiation  and  bars  complex  conjugates.  The  con- 
stants u and  < are  defined  as  p = E/2(l+v)  and  < = 3-4v  (plane  strain)  and 
k = (3-v)/(l+v)  (plane  stress)  where  E and  v are  Young's  modulus  and  Poisson's 
ratio,  respectively. 

For  a plate  with  no  crack,  the  stress  functions 


yield  the  stress  distribution 


a =0,  t =0,o  = -Ty  (3) 

y ’ xy  x J 

which  is  of  course  the  desired  loading  for  large  |z|. 

The  physical  region  in  Figure  2 can  be  described  conveniently  by  the 
mapping 

z = <o(C)  = ZQ  + *ia(L/2)U  + C_1)  (4) 

The  unit  circle  and  its  exterior  in  the  ?-plane  are  mapped  into  the  crack 
and  its  exterior  in  the  z-plane.  In  particular,  £=1  maps  into  the  crack  tip 
A and  5 = -1  maps  into  the  crack  tip  B. 

The  stress  functions  $(z)  and  if)(z)  can  now  be  considered  as  <Kc)  and 
<HS)  where  <t>'(z)  now  corresponds  to  cf> ’ (c)/w'  (5) , etc.  Using  the  well-known 
continuation  arguments  of  Muskhelishvili , the  crack  is  traction-free  if  we 
set 


♦ CO  = - <Kl/d  - uCl/CH'CO/u'CO  (5) 

and  the  extended  definition  of  <f>(?)  leads  to  a function  continuous  across  the 
unit  circle.  On  the  other  hand,  from  (2)  the  loading  conditions  at  infinity 
requi re 

<KO  -*■  iTz2/8  ->  iT(L/32)  [U2l0lC2  + (6) 

HO  -*■  - iTz2/8  ->  - iT(L/32)  [L«,2l“c2  + 4zqJ1101C 
for  large  ( £ | . 

Conditions  (5)  and  (6)  are  satisfied  by  choosing 

<HO  = [iTL2/32 ] j)l2ia^2  + 4(zo/L)ilia5  - [l_2i®  - 2]C"2  (7) 

+ [8i  UQ/L)  sin  a + 4(zq/L)  £~ia]  5“ 1 
and  this  completes  the  formal  solution. 

3.  THE  CLASSICAL  SOLUTION  WHEN  zn  = 0.  The  classical  solution  for 
the  centrally  located  crack,  z0  = 0,  will  first  be  considered.  The  crack 
tip  A lies  in  apparently  a compressive  field  and  we  can  anticipate  a physi- 
cal incompatibility  of  the  solution. 


The  stress  intensity,  ; , at  the  crack  tip  A in  general  will  be 
made  up  of  Modes  I and  II  and  can  be  calculated  from 


i = 2*'(l)[£la co"(l)]‘  1/2 


= - T(L3,  2/2)sin2a(sin  a + i cos  a) 


whence 


,,(1)  .3/2  . 3 

K ^ • = - (T/2)  L sin  a 


..(1)  3/2  . 2 

= - (T/2) L sin  a cos  a 


Similarly,  at  crack  tip  B (corresponding  to  5 = - 1), 

„(1)  .3/2  . 3 

K1B  = (T//2)L  sin  a 

(10) 

,,(13  3/2  . 2 

K2B  = ” *-T'2)L  Sln  a cos  a 

A clue  to  the  unacceptability  of  the  solution  is  negativeness  of 

i.  /V 

In  order  to  examine  the  physical  compatibility  of  the  displacements  of 
the  crack  surfaces,  we  introduce  a (£,ri)  coordinate  system  where  £-  and  ri 
are  along  and  normal  to,  respectively,  the  crack  direction.  Then 


Uj.  + i u^  = l la(u  + iv)  = {.”la(K  + 1) <f (°) /2y 


for  the  crack  boundary  where  a = l are  points  on  the  unit  circle  in  the 
C-plane.  The  condition  for  no  "overlapping"  of  the  crack  boundaries  can  be 
written  as 

u^(e)  - u^(-e)  >_  o o £ e <_  it  (12) 

When  z =0, 
o ’ 

un(0)  - u^(-9)  = (k  + l)TL2sin  a sin  29[cos  2a  - l]/16y  (13) 

which  (except  for  the  trivial  cases  a - o,  it)  clearly  violates  the  no  over- 
lapping condition  (12)  in  the  interval  o < 9 < ir/2. 
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4.  DETERMINATION  OF  A PHYSICALLY  ACCEPTABLE  SOLUTION.  The  plan  for 
determining  a physically  acceptable  solution  depends  on  admitting  crack 
closure  over  segments  of  the  crack  without  overlapping.  If  the  crack  tip  is 
involved  in  the  region  of  overlapping,  as  is  the  case  in  the  present  problem, 
a necessary  condition  for  an  acceptable  solution  can  be  expressed  in  terms 
of  the  stress  intensity  factors  from  a consideration  of  the  local  stress 
and  displacement  fields. 

Consider  the  crack  tip  A and  the  displacements  u^  and  u^  to  the  first 
order  of  the  local  crack  tip  expansion.  A necessary  condition  for  no  local 
overlapping  can  easily  be  shown,  Kj  ^ 0,  from  a consideration  of  u^.  If, 
in  addition,  we  assume  crack  closure  in  the  neighborhood  of  A,  then  an  must 
be  non-tensile  across  this  interval.  Therefore,  a necessary  condition  for  a 
physically  acceptable  solution  is  Kj  = 0 at  A.  No  claim  as  to  the  sufficiency 
of  this  condition  can  be  made  as  the  stress  intensity  reflects  only  the 
dominant  term  of  the  local  solution.  A solution  arrived  at  on  this  basis 
must  still  be  tested  for  its  overall  consistency. 

In  the  present  case,  we  consider  zQ  as  undetermined  and  impose  the 
vanishing  of  at  A.  Since 

4>'(1)  = - iT(L2/4)  | sin2a  + (2/L)  (sin  a)  Im  zq  j (14) 

it  follows  that  Kj  = 0 at  A if  we  choose 

Im  z = - (L/2)sin  a (15) 

Although  there  are  no  restrictions  on  Re  z , we  choose  zQ  so  that  the  crack 
passes  through  the  origin  of  coordinates,  ?hus 


zQ  = - (L/2)2la  (16) 

With  this  choice  of  zq,  we  reexamine  the  non-overlapping  condition  (12). 
On  the  crack. 


u = (<  + 1)TL2  <4  sin3  a sin  0(1  - cos  6) 

2 2 / 

+ cos  a(l  + 2 sin  a)(l  - 2 sin  0-2  cos  0)  > /32y, 


thus , 

u^(0)  - u (-0)  = (k  + l)TL2sin3a  sin  0(1  - cos  0)/4p  (18) 

which  clearly  satisfies  (12)  for  0 <_  0 <_  u and  hence  is  a physically 
acceptable  displacement  field. 

The  stress  intensity  factors  in  the  present  case  are 


m 


K 


,(2) 

klA 

(2) 

IB 

f(2) 

2B 


■ • 


-.3/2  . 3 
TL  sin  a 


3/2  2 

- TL  7 sin  a cos  a 


(19) 


Furthermore,  it  is  easily  verified  that  the  forces  normal  to  the  segment  AC 
in  Figure  2 are  compressive. 

It  is  clear  that  the  present  solution  can  be  considered  as  a physically 
acceptable  solution  for  a central  crack  along  the  segment  BC  where  closure 
occurs  on  the  segment  AC.  We  do  assume,  of  course,  that  the  frictional 
properties  of  the  crack  surfaces  are  consistent  with  a continuous  displace- 
ment solution  along  AC,  i.e.  closure  without  slippage. 

A comparison  with  the  previously  derived  classical  solution  for  a 
centrally  located  crack  can  now  be  made  by  observing  the  change  in  the  stress 
intensity  calculation  at  point  B.  The  crack  AC  corresponds  to  a crack  length 
of  2L  if  an  effective  half  crack  length  of  2L/3  is  used  in  the  calculation 
of  (19).  Thus,  the  "corrected"  stress  intensity  factors  at  B are 


K1B  = T(2L/3)3/2sin3a 

K2B  = - T(2L/3)3/2sin2a  cos  a 


(20) 


Since 


•W'w  ■ K2B/K2B)  " nw)3/2 


(21) 


the  classical  estimate  of  the  stress  intensity  factor  at  B is  in  error  on 
the  non-conservative  side  by  approximately  nine  percent. 

5.  CENTRAL  CRACK  IN  A FINITE  STRIP  UNDER  BENDING.  We  consider,  now, 
the  more  difficult  problem  of  a central  crack  in  a strip  of  finite  width 
under  bending.  Figure  3,  where  the  solution  cannot  be  found  in  closed  form 
and  the  previous  arguments  must  be  carried  out  numerically.  For  the  con- 
figuration in  Figure  3,  Benthem  and  Koiter  [2]  have  estimated  the  crack 
tip  stress  intensity  factors  at  B for  the  classical  solution  of  the  problem 
by  using  an  effective  asymptotic  argument. 
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Figure  3.  Central  crack  in  a strip  under  bending. 

The  solution  was  carried  out  using  the  MMC  (Modified  Mapping  Collocation) 
method  combined  with  finite  elements  [3,4],  This  plan  is  based  on  "partition- 
ing" the  region  and  using  a representation  of  the  solution  appropriate  to 
each  sub-region.  The  boundary  conditions  along  with  appropriate  "stitching" 
conditions  between  the  representations  must  be  satisfied  by  the  solution. 

The  details  of  this  approach  have  been  previously  documented  and  will  not  be 
repeated  here. 

The  partitioning  plan  is  indicated  in  Figure  3.  The  region  MRSN  was 
described  using  the  mapping  function 

z = zq  + i(«,/2)U  + 1/5)  (22) 

which  clearly  maps  the  unit  circle  in  the  5-plane  into  the  crack  AB.  A 
series  representation  of  the  solution  was  chosen  in  the  corresponding  para- 
meter region  and  traction- free  conditions  on  the  crack  were  enforced  by  the 
continuation  argument,  e.g.,  F.quation  (5).  The  boundary  conditions  on  RS 
and  MN  and  the  stitching  conditions  on  RM  and  SN  were  imposed  by  the  colloca- 
tion arguments  outlined  in  [4],  In  the  complementary  regions  (the  shaded 


26 


areas  in  Figure  3)  a finite  element  representation  of  the  solution  was  taken. 
Imposed  on  this  representation  were  the  appropriate  stitching  conditions, 
traction-free  boundary  conditions  and  end  loading, 

ax  = - (3M/2b3)y  (23) 

Again,  we  seek  a value  of  z0  such  that  the  stress  intensity  Kja  = 0 
and  the  crack  displacements  and  the  forces  on  AC  are  physically  compatible 
with  our  argument.  It  was  found  that  z0  can  be  determined  quite  readily  by 
iteration.  From  the  infinite  sheet  solution,  it  is  evident  that  for  small 
il/b  ratios,  z0=£/2.  With  this  as  a guide  for  the  first  approximation, 
only  a few  trials  were  required  to  find  the  proper  value  of  z0  for  successively 
increasing  values  of  l[ b. 

The  numerical  results  are  presented  in  Table  1.  Again  the  results  are 
to  be  compared  with  the  classical  solution  for  the  central  crack,  z0  = 0. 

The  effective  half  crack  length,  L,  is  evidently 

L = % + | zj  (24) 

Table  1 

"Corrected  Stress  Intensity  Factors,  Kjg, 
for  Central  Crack  in  Strip  under  Bending 


** 


L/b 

t/b 

zo/b 

OA/b 

kib 

kib 

kib 

Mb'3/2 

Mb-3/2 

kib 

0.1 

0.067 

-0.033 

0.033 

0.0259 

0.0237 

1.09 

0.2 

0.133 

-0.067 

0.067 

0.0733 

0.0672 

1.09 

0.3 

0.200 

-0.100 

0.100 

0.136 

0.124 

1.10 

0.4 

0.270 

-0.130 

0.140 

0.213 

0.193 

1.10 

0.5 

0.340 

-0.160 

0.180 

0.304 

0.276 

1. 10 

0.6 

0.414 

-0.186 

0.228 

0.417 

0.379 

1.10 

0.7 

0.492 

-0.208 

0.284 

0.567 

0.516 

1.10 

0.8 

0.574 

-0.226 

0.350 

0.796 

0.727 

1.10 

0.9 

0.668 

-0.232 

0.432 

1.280 

1.163 

1.10 

1.0 

0.763* 

-0.237* 

0.526* 

•Extrapolated 
••Benthem  and  Koiter  [2] 

It  is  interesting  to  compare  the  present  results  with  the  classical 
results  of  Benthem  and  Koiter.  Within  one  percent,  the  classical  solu- 
tion underestimates  the  K^g  values  by  nine  percent  for  all  values  of  L/b. 

6.  MODIFICATION  OF  THE  ASYMPTOTIC  APPROXIMATION.  In  [2],  Benthem  and 
Koiter  introduced  a non-dimensional  factor  K by  writing 
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Ky  = K 3LM(aL/b)1/2/2(b3  - L3)  (25) 

where  K is  a polynomial  in  L/b.  An  approximate  solution  for  K was  found  by 
order  of  magnitude  considerations  of  the  two  limiting  cases,  L/b  -*  0 and 
a/b  ->  0. 

The  modifications  of  their  arguments  for  the  "corrected"  solution  for 
L/b  -*•  0 can  now  be  carried  out  by  using  our  solution  for  the  central  crack 
in  an  infinite  sheet.  In  particular,  if  the  order  of  magnitude  considerations 
of  [2]  are  modified  by  Equation  (20),  then,  at  the  crack  tip  B, 

K10  + (2/3)3/2[3ML3/2/2b3][l  + 0(L4/b4] 

18  (26) 

for  L/b  0 

From  a comparison  of  Equations  (25)  and  (26) , 

K - (2/3)  3/2 [1  + (1/2) (L/b)  + (3/ 8) (L/b) 2 - (11/16) (L/b)3 

4 4 C27) 

+ 0 (L  /b  J for  L/b  + 0 

For  the  second  limiting  case,  a/b  ->  0,  by  using  the  anti-symmetry  of 
the  classical  problem  and  the  "edge  dam"  solution,  the  authors  of  [2]  found 

K -*  2/ (it2  - 4) 1/2  = 0.826  for  a/b  + 0 (28) 

Unfortunately,  due  to  the  non-linearity  of  our  present  solution  no  such 
limit  can  be  rigorously  argued.  On  the  other  hand,  a reasonable  estimate  of 
this  limit  can  be  found  by  extrapolation  of  the  data.  From  Table  1,  the 
segment  0A  can  be  extrapolated  as  0A  ■+■  0.52b  as  a/b  -*  0.  Furthermore,  the 
stress  distribution  a along  the  centerline  from  A to  the  edge  is  very  nearly 
linear.  From  equilibrium  conditions,  it  can  be  argued  that  the  local  stress 
at  B is  nine  percent  higher  than  in  the  classical  case.  Thus, 

K (1.09)  (0.826)  for  a/b  -*•  0 (29) 

(Although  (29)  is  an  extrapolated  estimate,  it  was  verified  that  reasonable 
variations  in  the  approximation  altered  this  result  by  no  more  than  one  percent.) 

Therefore,  the  simplest  polynomial  interpolation  between  these  asymptotic 
results  yields 

K = (2/3) 3/2 [ 1 + (1/2) (L/b)  + (3/ 8) (L/b) 2 - (11/16) (L/b) J 

4 (30) 

+ .464  (L/b J ] 


3/2 

Equation  (30)  is  identical  with  the  K in  [2]  if  (2/3)  were  replaced  by  1/2. 

7.  EDGE  CRACK  IN  A STRIP  UNDER  BENDING.  At  about  the  same  time  as 
the  author's  solution  [5],  Paris  and  Tada  [6]  considered  the  solution  for 
an  edge  crack  in  a strip  under  bending  again  allowing  for  interference  of 
segments  of  the  crack  surfaces,  Figure  4. 


Figure  4.  Edge  crack  in  strip  under  bending. 

It  is  obvious  physically  that  for  C/W  <_  1/2,  assuming  no  friction 
between  the  crack  surfaces,  the  admissible  solution  is  one  which  predicts 
the  strip  is  in  uniform  bending  with  the  crack  surfaces  interferring  and 
carrying  a compressive  load.  For  C/W  _>  1/2,  it  is  also  clear  that  the 
solution  is  identical  to  our  results  for  the  central  crack  with  a modified 
interpretation  of  the  parameters. 

In  Paris  and  Tada's  analysis,  the  crack  tip  stress  intensity,  K,  was 
approximated  by 

K-  = G(C/W)H(C/W)  \AcT  (31) 

where 

G (C/W)  = (2/3)3/2(2C/W) (1  - W/2C)3/2  (32) 
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based  on  the  solution  for  a central  crack  in  an  infinite  sheet  under  bending. 
(Note  that  in  Equation  (31),  the  alternate  introduction  of  y^rT'in  the 
definition  of  stress  intensity  factors  has  been  made.)  The  function 
H(C/W)  was  taken  as  the  correction  for  the  effect  of  the  finite  width  of 
the  strip.  Paris  and  Tada  did  not  calculate  H(C/W)  exactly,  instead,  they 
assumed  an  approximation  based  on  the  finite  width  correction  for  a center 
cracked  finite  width  strip  under  tension.  Their  numerical  results  are 
listed  in  Table  2. 

The  results  which  we  have  derived  can  be  applied  with  the  following 
changes  in  notation. 


L = a1  = C - W/2 

b = W/2  (33) 

a = b - L 

L/b  = 2 (C/W)  - 1 = X 
M = (2/3)b2a 
Kx  = K- 

Then, 

K-/o  /7c  = R(A)A3/2  Vl  - A/(l  - X3)  Vl  + A 
where  (34) 

R(A)  = (2/3) 3/2[ 1 + A/2  + 3A2/8  - 11A3/16  + .464  4] 


1 / 1 

A comparison  of  the 

results  is 

shown  in  Table  2. 

Table  2 

Values  of  K /a  \/rC 

| V 

\ 

C/W 

X 

Equation  (31) 

Equation  (34) 

1 it 

n 

0.50 

0.0 

0.0000 

0.0000 

0.55 

0.1 

0.0165 

0.0164 

i 

% 

0.60 

0.2 

0.0453 

0.0445 

0.70 

0.4 

0.129 

0.118 

• 

0.80 

0.6 

0.259 

0.217 

L ‘A 

t 30 

[ t* 


The  approximation_of  H(C/W)  used  by  Paris  and  Tada  appears  to  exaggerate 
the  stress  intensity  K for  the  deeper  cracks.  For  cyclic  bending  of  an 
edge  cracked  strip,  the  moment  M contributes  to  the  £rack  opening  after  the 
crack  has  reached  the  half  width  of  the  strip.  The  K contributes  for 
further  crack  growth  can  then  be  determined  from  Equation  (34). 


8. 


OBSERVATIONS.  The  problem  of  "crack  overlapping"  occurs  in  several 
of  the  classical  analyses  found  in  the  literature.  The  results  of  this 


investigation  would  appear  to  indicate  that  the  errors  so  introduced  are 
sufficient  to  warrant  a more  careful  consideration  of  such  solutions. 
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SUMMARY 


A finite  element  formulation  is  described  for  problems  with  solution 
functions  known  to  have  local  r*  variation  (s),  0<X<l,and  thus  singular  gradi- 
ents. Special  3-node  triangular  elements  encircle  the  singularity  and  focus  to 
share  a common  node  at  the  singular  point.  The  shape  function  of  each  triangle 
has  the  appropriate  r^  mode  and  a smooth  angular  mode  expressed  in  element 
natural  coordinates.  As  with  standard  elements,  the  unknowns  are  the  nodal 
values  of  the  function.  Even  if  the  precise  angular  form  of  the  asymptotic 
solution  is  known,  the  formulation  makes  no  attempt  to  embed  it,  but  instead 
piecewise  approximates  it.  This  allows  assembly  of  the  element  coefficient 
matrix  using  standard  procedures  without  nodeless  variables  and  bandwidth 
complications . 

The  conditions  of  continuity,  low  order  solution  capability,  and  accurate 
numerical  integration  of  the  singularity  element  are  discussed  with  a view 
towards  establishing  the  general  range  of  applicability  of  the  formulation. 
Numerical  applications  to  the  elastic  fracture  mechanics  problems  of  composite 
bondline  cracking  and  crack  branching  are  discussed. 

INTRODUCTION 


We  are  considering  here  the  problem  of  attaining  accurate  numerical 
representation  of  a function  <J>  (x,y)  when  near  discrete  points  in  the  domain  <J> 
varies  as  r\  0<X<  1.  ^Standard  shape  functions  cannot  properly  model  the 
singular  gradient  of  r so  our  approach  has  been  to  design  a special  singularity 
element.  Beyond  embedding  the  proper  singularity  into  the  shape  function,  the 
usual  questions  of  interelement  continuity,  constant  state  representation,  and 
accurate  numerical  integration  are  addressed. 

Interelement  continuity  should  be  maintained1  for  $ and  its  derivatives 
up  to  one  order  less  than  that  occurring  in  the  governing  volume  integral, 
denoted  by  I,  of  the  problem.  Subsequently,  it  will  be  shown  that  the 
singularity  element  has  <)>  interelement  continuity  but  no  guaranteed  con- 
tinuity of  4)  gradients  across  edges.  Strictly  speaking  then,  it  is  limited 
to  problems  where  I = I (<J>,  3<(>/3  x.).  For  example,  this  is  the 
case  in  the  potential  energy  formulation  of  elasticity  where  the  governing 
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functional  of  displacement  involves  only  first  order  derivatives.  The 
virtual  work  formulation  of  plasticity  is  another  such  case  with  <t> 
representing  the  displacement  increment. 

The  other  finite  element  convergence  criterion  * is  that  an 
element  should  be  capable  of  representing  fields  with  constant  values 
of  <t>  , or  derivatives  of  <i>  up  to  the  order  occurring  in  I.  This  is 
necessary  because  in  the  limit  of  vanishing  element  size,  and  its 
derivatives  should,  within  the  element,  equal  the  pointwise  constant 
values.  From  a practical  standpoint  the  constancy  conditions  are 
important  only  when  constant  state  conditions  exist  over  the  finite 
subdomain  occupied  by  the  element.  The  boundary  conditions  of  a 
singularity  problem  can  cause  smooth  as  well  as  singular  <b  variations 
near  the  singular  point.  The  constancy  capability  of  the  elements  at 
the  singular  point  is  important  only  if  the  smooth  terms  are,  on  an  element 
average  basis,  comparable  in  value  to  the  singular  terms.  The  element 
introduced  below  has  modes  of  the  constant  and  r^  type.  It  does  not 
have  the  polynomial  terms  necessary  to  represent  non-zero  constant 
derivatives.  Since  the  singular  mode  dominates  the  uniform  mode  as 
the  singularity  is  approached,  the  lack  of  the  latter  mode  is  of  diminishing 
consequence  as  element  size  is  reduced,  and  thus  convergence  is  achievable 
in  this  sense.  However  it  is  clear  that  the  element  is  not  suited  for  problems 
without  an  "active"  singular  ity. 

FORMULATION 

The  element  described  here  is  a generalization  of  the  singular 
Z 1/2 

element  suggested  for  analysis  of  the  r ' elastic  crack  tip  singularity. 
The  element  is  a 3 node  triangle,  and  has  one  of  its  nodes  at  the  singular 
point.  The  power  form  variation  is  chosen  in  the  direction  away  from  the 
singular  point;  low  order  smooth  variation  is  chosen  in  the  angular  direction. 
Figure  la  illustrates  the  modeling  with  one  of  a necessary  group  of  triangles 
at  the  singular  point,  node  I.  The  shape  function  is  developed  in  terms  of  the 
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oblique  coordinates  ?,  n which  vary  over  the  range  [ 0,  ljwithin  the 
element.  The  radial  edges  correspond  to  r)  = 0,  1.  The  edge  ?=  0 is 
actually  a point  - the  singular  point-and  the  far  transverse  edge  is  § = 1. 

The  transformation  to  cartesian  coordinates  follows 

x = x 1 ( 1 - §)  + x/"'  5(  1 - n)  + x K ( U 

It  is  straightforward  to  show  that  ? is  always  a linear  function  of  r 
times  a trigonometric  function  of  angular  orientation  within  the  element 
and  that  ri  is  solely  a trigonometric  function  of  angle.  As  an  example, 
the  isosceles  triangle  of  Fig.  lb  has  the  transformation  equations 

§ = (r  cos  9)  / x = x/x 

o o 

(2) 

r|  = (tan  8/tana  + 1)  /2  = (y/x*x  /y  + l)/2 

o o 

With  ? being  a linear  function  of  r,  <t>  varies  as  rX  when 
§X  terms  are  chosen  in  the  shape  functions;  such  a choice  yields  the 
interpolation  function 

i = <t>1  ( l - SX)  + SX  (l  - n)  + «*K  ?X  t)  (3) 

For  the  isosceles  triangle  this  corresponds  to 

<t>  - <t>  (1  - (x/x  )^)  + 1/2  (1-  y /x  • x /y  ) (x/x  )X 

o o o o 

(4) 

+ 1/2  (1  + y/x  • x /y  )(x/x  )X 

o o o 

By  using  a group  of  these  elements  about  the  singularity,  the 
angular  form  of  the  asymptotic  solution  is  approximated  in  a piecewise 
smooth  fashion.  The  singular  radial  variation  is  embedded  throughout 
the  region  occupied  by  the  elements. 

On  the  radial  edges  ^ is  a two  parameter  function,  e.g.  on  IJ 
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(5) 


6 = 4>l  + (4  J - tl)  ?X 


so  that  there  is  continuity  of  </>  on  these  edges.  On  JK  ^ is  a linear 
function  of  position  which  guarantees  continuity  with  an  element  such 
as  the  bilinear  isoparametric.  Derivative  continuity  across  element 
dges  is  not  guaranteed  so  that,  as  previously  discussed,  the  element 
strictly  applies  only  to  those  problems  whose  governing  integrals  are 
independent  of  second  and  higher  order  <b  derivatives. 

The  element  is  capable  of  representing  a constant  i>  condition 
as  can  be  seen  by  substituting  a constant  for  the  nodal  values  in  the 
interpolation  function  and  observing  that  <i>  then  equals  the  constant. 
Without  a linear  term  in  the  shape  function  the  constant  first  derivative 
condition  cannot  be  met.  In  analysis  of  deformable  solids  where  <t> 
w mid  be  the  displacement  function,  situations  such  as  rigid  rotation 
and  uniform  thermal  expansion  correspond  to  a linear  <k>  mode.  The 
element  cannot  directly  accommodate  these  cases,  but  by  choosing  a small 
enough  element  the  singular  mode  will  dominate  the  exact  solution  making 
the  exclusion  of  the  linear  mode  inconsequential. 

The  singular  nature  of  the  d gradients  does  not  preclude  the 
possibility  of  accurate  numerical  integration  in  forming  the  coefficient 
matrix.  It  is  assumed  from  the  outset  that  the  rX  variation  gives  rise 
to  an  integrable  singularity.  Standard  methods  of  integration  have  been 
developed  for  polynomial  variations  so  that  these  can  be  used  only  for 
the  angular  integration.  In  general  the  problem  is  to  integrate  terms  of 
the  form 


f 


0 


; 


0 


f (?)  5 d§ 


g (n)  dn 


(6) 


The  determinant  of  the  Jacobian  , b(x,  y)/J>(?,  r|),  accounts  for  the  factor  ? 
of  the  inner  integrand.  For  the  examples  below  a 2 -point  Gauss  rule  was 
used  for  the  r>  integration.  The  form  of  f(£)  must  be  scrutinized  before 
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choosing  a ? integration  rule.  For  elasticity  the  governing  integral  is  a 
quadratic  function  of  the  shape  function  first  derivatives  and  this  results 
in 


f = ? 


2X 


- 2 


(7) 


Hence 

1 1 2X  1 

/ f 5 d?  = / § d§  = 1/2X  (8) 

0 0 

For  the  elasticity  examples  below,  the  numerical  technique  employed  to 
achieve  precisely  the  result  (8)  was  a specialized  1-point  rule:  one 

integration  station  was  used  at  location  5 - (2X)  ^ ^ and  its  weight 

was  unity.  It  is  easily  appreciated  that  standard  methods  of  integration 
can  be  very  much  in  error  for  this  problem,  particularly  for  X < 0.  5. 

Hence,  generally  speaking,  detailed  invest igat ion  of  f (§)  is  required 
for  design  of  an  adequate  integration  procedure. 

EXAMPLES 

The  examples  are  problems  of  elastic  fracture  mechanics.  The  finite 

element  approach  employed  was  that  based  upon  the  principle  of  minimum 

potential  energy,  so  that  <t>  of  the  last  section  now  stands  for  the  displacement 

vector  function.  The  first  problem  is  the  bimater ial  elastic  strip  with  a 

pressurized  crack  normal  to  and  terminating  at  the  bondline.  'rhe  geometry  is 

illustrated  in  Fig.  2.  The  material  on  the  left  is  cracked  and  designated  as 

material  1 with  shear  modulus  p and  Poisson's  ratio  V^;  material 

2 to  the  right  has  properties  p^,  * Crack  length,  plate  width,  and 

height  are  related  by  a/b  = a/h  = 1/9.  The  left  end  of  the  crack  being 

surrounded  completely  by  one  material  is  a singular  point  with  displacement 

1/2 

varying  as  r . The  bondline  crack  tip  has  a singularity  dependent  upon 
the  bimaterial  elastic  properties.  Displacement  varies  as  r^-  with  X a 
function  of  p.  /p,  and  also  the  type  of  planar  deformation,  i.  e.  plane  stress 

vs.  plane  strain.  The  examples  here  are  plane  strain  and  the  material 

combination  is  aluminum-epoxy.  For  aluminum  p = 3.846  x 10  psi, 
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V = 0.  3;  and  for  epoxy  p.  = 0.  1667  x 10  psi,  V = 0.35.  With  aluminum 
as  the  cracked  material  m=  = 043  and  X = 0.  1752.  When  epoxy 

is  the  cracked  material  m = 23.08  and  X = 0.6619. 

Figure  3 shows  the  mesh  used  in  the  crack  location.  Symmetry 
allowed  modeling  just  the  upper  half  of  the  strip.  Isosceles  triangle  s 
with  a radial  dimension  of  a/ 100  and  angular  extent  of  15°  were  used 
as  singularity  elements  about  each  crack  tip.  Of  course,  about  each 
tip  the  appropriate  value  of  X was  used  to  generate  the  element  stiffnesses. 

The  radial  dimension  of  the  singularity  elements  is  a crucial  aspect  of 
the  finite  element  model.  The  singularity  elements  should  be  entirely 
within  the  region  where  displacement  is  accurately  represented  by 
the  r^  form.  The  crack  opening  displacement  data  from  available 

3 

singular  integral  equation  solutions  were  used  to  establish  the 
suitability  of  the  radial  dimension  a/  100.  When  there  is  no  basis  for 
judgment  of  the  range  of  dominance  of  the  leading  power  term  in  the  full 
solution,  a convergence  study  must  be  conducted  by  successively  decreasing 
element  size  to  establish  accuracy  estimates  of  the  singularity  solution. 

Bilinear  isoparametric  elements  were  used  to  model  the  plate 
away  from  the  singularities.  The  total  mesh  involved  429  nodes  and  433 
elements.  The  forces  specified  to  be  acting  on  the  crack  face  nodes  were 
calculated,  in  terms  of  the  uniform  pressure  p,  consistent  with  the  element 
shape  functions.  Thus,  the  singularity  element  node  on  the  crack  face  had 
an  applied  normal  force  per  unit  thickness  equal  to  . 01  pa/(l  + X). 

Three  features  of  the  solutions  to  be  discussed  are  the  angular 
distribution  of  stress  about  the  bondline  crack  tip,  the  crack  opening 
behavior  near  the  bondline,  and  the  stress  intensity  factors.  The  angular 
variation  of  the  normalized  stress  a / P through  the  ring  of  bond  tip 

yy 

singular  elements  is  given  in  Figure  4.  Data  are  given  for  both  p^/p^ 
combinations.  Along  with  the  finite  element  data  at  the  twelve  discrete 
midpoint  angles,  singular  integral  equation  (SIE)  data  are  given  at  angles 
of  0,  90  and  180°  and  r = 0.005a.  The  first  striking  charar'  eristic  of 
the  distribution  is  the  discontinuity  of  stress  across  the  bondline. 
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Independent  of  which  material  is  cracked,  at  90°  - the  bondline  - the 
aluminum  is  stressed  higher  than  the  epoxy.  Hence,  when  the  epoxy  is 
cracked  0 (90  ) exceeds  a (90  ),  and  just  the  opposite  when 

yy  yy 

the  aluminum  is  cracked.  There  is  very  good  agreement  between  the 

SIE  and  finite  element  solutions  with  the  exception  of  the  90  values 

for  m = 0.043.  The  finite  element  mesh  is  perhaps  too  coarse  in  the 

angular  sense  to  accomodate  the  large  gradient  in  the  range  90-  180° 

for  m = 0.  043,  so  that  mesh  refinement  might  improve  this  deviation. 

In  Figure  5 the  normalized  crack  opening  displacement  u /a 

is  plotted  as  a function  of  distance  from  the  bondline  crack  tip  to 

r/a  = 0.  16  for  the  two  p^/p^  cases.  The  data  corresponds  to  a unit 

value  of  crack  face  pressure.  The  finite  element  data  appear  in  discrete 

fashion  in  the  plot  and  for  comparison  purposes  the  SIE  solutions  are 

presented  and  are  represented  by  the  solid  curves.  There  is  excellent 

agreement  between  the  solutions  for  m = 23.  08,  and  this  is  true  over 

the  entire  crack  face,  0 <_  r/a  <2.  While  the  SIE  and  finite  element 

data  agree  at  r/a  = 0.  01  for  m = 0.  043,  the  solutions  differ  by  5-10% 

over  most  of  the  crack  face,  including  near  the  embedded  end.  There 

is  a dramatic  difference  in  the  opening  behavior  local  to  the  bondline 

for  the  two  cracked  cases.  The  SIE  curves  demonstrate  the  behavior 

0.  175  0. 662 

which  is  expected  from  the  r and  r asymptotic  displacement 

solutions.  With  epoxy  bonded  to  cracked  aluminum  there  is  a rapid  gradient 
in  opening  which  is  intuitively  consistent  with  the  stiffness  mismatch. 

The  intersection  of  the  two  curves  is  near  r/a  = 0.  01  , the  location  of  the 

first  finite  element  node,  and  the  opening  displacements  u /a  there  are 

- 6 _ & ^ 

0.193  x 10  for  m = 23 . 08,  and  0.222  x 10  for  m = 0. 043 . 

The  stress  intensity  factor,  generalized  for  both  the  embedded  and 

bondline  crack  tips  is  defined  as 

K=lim/2r1^a  (r,o)  (9) 

yy 

r->  o 7 1 

To  deduce  K from  the  displacement  data, the  following  equation  was  used 


I 


N 


K = 2/2  X fx*  u (r,TT)/r 

y 


(10) 


ic. 


The  modulus  p.*  is  defined  from  the  relationship 


u (r,TT)  = r <7  (r,o)/2Xtx* 

y yy 


(ii) 


) i 

r* 

ir.f 


p#  is  an  algebraic  function  of  the  bimaterial  constants  and  the  eigenvalue  X. 

For  the  plane  strain  homogeneous  material  case,  m=l,  p*  is  equal 
to  p/2(  1 -v). 

From  eqn.  (10),  the  stress  intensity  factor  at  the  embedded  tip, 

K/pfa  , computed  from  the  finite  element  data  at  r/a  = 0.01  was  found  to 

equal  0.  89  when  m=23.  08,  and  1.  52  when  m-0.  043.  For  a homogeneous 

plate  the  result  is  1.  00,  and  this  shows  the  degree  to  which  the  aluminum  reduces 

the  severity  of  the  singularity  in  the  cracked  epoxy,  and  how  much  more  severe 

the  singularity  is  in  aluminum  when  epoxy  is  bonded  to  it.  The  values  for 

K / pa  ^ at  the  bond  line  crack  ends  are  2. 85  for  m=23 . 08,  and  0.  112  for 

m=0.  043.  The  SIE  displacement  data  predicts  es sentially  the  same  K values 

with  the  exception  of  the  embedded  tip  m=0.  043  value  which  is  10%  lower, 

consistent  with  the  displacement  deviation  mentioned  above.  A detailed 

discussion  of  the  results  of  the  bimaterial  crack  problem  will  be  reserved 

4 

for  a future  specialized  paper  . 

The  second  example  is  the  branch  crack  in  an  elastic  tension  strip. 

Figure  6a.  The  main  crack  emanates  from  the  free  edge  at  45  ° and  its 

projected  length  normal  to  the  tension  is  W/4.  W is  the  strip  width, 

and  3W  is  the  strip  length.  The  branch  normal  to  the  tension  has  length 

W/80.  There  are  two  singularities  in  this  problem  each  with  local  r^ 

displacement  distributions.  The  right  end  of  the  branch  has  the  usual 

crack  tip  singularity  with  X = 1/2,  while  the  angle  on  the  upper  face  of 

the  crack  is  a reentrant  corner  with  X = 0.  674.  These  conclusions  are 

5 

drawn  from  the  asymptotic  analysis  of  reference  . The  finite  element 
mesh  at  the  branch  is  shown  in  Figure  6b.  The  singularity  elements  were 
chosen  to  have  a radial  extent  5%  of  the  branch  length  and  an  angular  dimension 
of  22.5  ° 
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The  angular  variation  of  the  normalized  polar  stress  /p  about 

90  oo 

the  bend  singularity  is  given  in  Figure  7.  The  data  are  from  the  singularity 
element  midpoints.  The  stress  state  is  essentially  entirely  compressive 
with  peak  compression  equal  to  3.  1 p^at  0 = 125°.  This  suggests  that 
forking  would  not  occur  from  this  point.  At  the  right  end  of  the  branch 
the  stress  intensity  factors  K^,  were  deduced  from  the  singularity 

element  crack  face  nodal  displacements.  If  6 represents  the  relative 
opening  displacement  of  the  nodes  on  the  two  crack  faces  and  A the 
relative  sliding  displacement,  the  equations  used  to  determine  and 
for  this  plane  stress  example  were 


Notice  that  the  factor  /tt  is  not  used  in  these  definitions.  The  value  of 
Kj  was  found  to  be  4%  lower  than  the  value  for  a normal  to  the  tension 
unbranched  crack  with  length  (1.05)  W/4, 

Kr  = 1.49  ‘coV"  (1. 05)  W/4 

-2 

Kn  was  determined  to  be  negligible  in  relation  to  K^,  K^/K^  < 10  . An 

additional  problem  was  considered  which  had  the  above  geometry  altered  by 
extending  the  branch  length  to  W/40.  again  was  4%  lower  than  that  of 

the  projected  length  crack 

Kj  = 1.52  *2/(1.  10)  W/4 

and  K/K  < 10~4  . 

II  I 
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CONCLUSIONS 


The  solutions  to  the  crack  problems  are  judged  to  be  very  accurate. 
The  agreement  between  the  singular  integral  equation  and  finite  element 
results  for  the  bimaterial  problems  supports  this  conclusion.  Certainly 
no  standard  finite  element  formulation  can  be  expected  to  provide  reasonable 
solutions  to  problems  such  as  these.  The  formulation  proposed  here 
allows  routine  analysis  of  a class  of  singularity  problems  which  heretofore 
has  been  approached  only  with  elaborate  analytical  methods.  The  singular 
element  proposed  is  simple  to  implement  since  it  is  easily  programmed 
using  techniques  which  today  are  commonplace. 
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FIGURE  6a 


Strip  With  Branch  Crack 
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CRACK  TIP  FIELDS  IN  STEADY  CRACK  GROWTH  WITH  LINEAR  STRAIN  HARDENING 


John  C.  Amazigo 
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Rensselaer  Polytechnic  Institute,  Troy,  New  York  12181 

and 

John  W.  Hutchinson 

Division  of  Engineering  and  Applied  Physics 
Harvard  University,  Cambridge,  Massachusetts  02138 


SUMMARY 

Singular  stress  and  strain  fields  are  found  at  the  tip  of  a crack 
growing  steadily  and  quasi-statically  into  an  elastic-plastic  strain  hardening 
material.  The  material  is  characterized  by  J^  flow  theory  together  with  a 
bilinear  effective  stress-strain  curve.  Anti-plane  shear,  plane  stress  and 
plane  strain  are  each  considered.  Numerical  results  are  given  for  the  order 
of  the  singularity,  details  of  the  stress  and  strain-rate  fields,  and  the 
near-tip  regions  of  plastic  loading  and  elastic  unloading. 


This  paper  is  to  be  published  in  the  Journal  of  Mechanics  and  Physics 
of  Solids. 
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FINITE-DIFFERENCE  SOLUTION  OF  POISSON'SEQUATION 
IN  RECTANGLES  OT  ARBITRARY  PROPORTIONS 
J.  Barkley  Rosser 

Mathematics  Research  Center,  University  of  Wisconsin, 
Madison,  Wisconsin 


1.  Introduction. 


We  consider  the  problem  of  getting  an  approximation  of  reasonably 
good  accuracy  by  finite-dilference  methods  for  the  function  u(x,  y) 
which  satisfies  Poisson's  equation 


(1-D 


V u(x,  y)  = f(x,  y) 


inside  a rectangle  R,  and  satisfies  various  boundary  conditions  on 
the  boundary  of  R.  When  f(x,  y)  = 0,  (l.l)  reduces  to  Laplace's 

equation,  and  the  problem  is  appreciably  simpler. 

This  problem  has  been  much  studied.  A common  approach  is  to 
cover  R exactly  with  a mesh  or  grid  of  small  rectangles,  after  which 
one  can  replace  (l.l)  by  a finite-difference  approximation  involving 
values  of  u(x,  y)  at  the  grid  points.  One  then  tries  to  solve  this  finite- 
difference  analogue  of  (l . l)  to  a suitable  degree  of  accuracy.  In  order 
to  employ  this  approach  when  high  accuracy  is  required,  it  has  been 
necessary  to  require  that  the  ratio  of  the  sides  of  R must  be  rational 
since  use  of  high  order  methods  usually  requires  that  one  cover  R 
exactly  with  a grid  of  squares.  However,  the  conformal  transformation 
method  of  Papamichacl  and  Whiteman  [ 2]  will  lead  more  often  than  not 


The  author  wishes  to  acknowledge  the  sponsorship  of  the  United 
States  Army  under  Contract  No.  DAAG29-7  fS-C-002-1  and  of  the  Science 
Research  Council  under  grant  B/RG  4 1 2 1 at  Brunei  University. 


to  a rectangle  in  which  the  ratio  is  not  rational,  and  covering  with  a grid 
of  squares  is  not  possible.  Even  when  the  ratio  is  rational,  there  may 
be  difficulties.  Suppose,  from  some  engineering  problem,  one  is  confronted 
with  a rectangle  R of  base  six  and  five-eighths  and  height  five  and 
seven-eighths.  If  this  is  to  be  covered  exactly  with  squares,  there  must 
be  53N  squares  along  the  base  and  47N  squares  along  a vertical  side, 
where  N is  a positive  integer.  With  such  a covering,  many  popular 
methods  would  operate  at  less  than  maximum  efficiency. 

Accordingly,  we  will  propose  a method  of  getting  good  accuracy 
with  moderate  labor  for  rectangles  of  arbitrary  proportions. 

2.  Formulation  of  the  problem. 

By  rotation,  translation,  and  scaling,  as  needed,  we  can  take 
the  rectangle  R to  be  that  shown  in  Figure  1 . By  rotating  through 

o 

another  90  and  translating  and  scaling  again,  if  need  be,  we  can 
assure  that  a > it.  If  a = tt,  we  have  a square,  and  familiar  approaches 
suffice.  So  we  assume  a >v. 

We  consider  first  the  case  of  Dirichlet  boundary  conditions.  That 
is,  we  wish  to  approximate  the  function  u(x,  y)  which  is  continuous 
on  and  inside  R,  satisfies 

(2.1)  V2  u(x,  y)  = f(x,  y) 
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The  rectangle  R 


Figure  1 


inside  R,  and  on  the  sides  of  R satisfies  the  Dirichlet  boundary 


conditions 


(2.2) 

(2.3) 

(2.4) 
(2.  5) 


u(0,  y)  = gQ(y) 
u(ir,  y)  = g (y) 

TT 

u(x,  0)  = hQ(x) 
u(x,  a)  = h (x) 

3 


0 < y < a 
0 < y < a 


0 < x < tr 


0 < x < IT  . 


Because  we  seek  a u(x,  y)  which  is  continuous  on  R,  as  well 

as  inside,  we  are  thereby  assuming  that  g (y)  and  g (y)  are 

continuous  for  0 < y < a,  that  h (x)  and  h (x)  are  continuous 

O 6 
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for  0 < x < tt,  and  that 


(2.6) 

g0(°) 

= M01  ’ 

(2.7) 

g0(a) 

= ha(0>  . 

(2.8) 

g (o) 

= h (tt)  , 

IT 

o 

(2.9) 

g„(a) 

= h (tt)  . 

a 

If  there  should  be  discontinuities  in  the  boundary  conditions,  or 
their  derivatives,  this  would  induce  still  another  source  of  errors  in 
the  solutions,  besides  those  due  to  truncation  and  round  off.  See 
Rosser  [ 3].  "Jump"  discontinuities  can  be  "removed"  by  the  methods 
on  pp.  221-222  of  Milne  [4].  More  complicated  discontinuities  can 
sometimes  be  "removed",  but  one  cannot  count  on  doing  this.  For  the 
present  treatment,  we  assume  that  the  boundary  conditions  and  their 
low  order  derivatives  are  continuous.  This  includes  continuity  at  the 
corners,  as  exemplified  by  (2.  6)  through  (2.9).  Or,  if  we  replace  (2.  2) 
by 

ux(°,  y)  = jQ(y)  0 < y < a , 

then  continuity  of  the  first  derivatives  at  the  corners  would  require 

io(0)  - h'o(0) 

]0(a)  = h;<0)  • 

3.  Finite -difference  approximations. 

There  are  finite-difference  approximations  of  various  orders.  The 
higher  order  methods  of  solution,  involving  the  higher  order  approximations, 


5o 


can  be  used  effectively  only  when  the  function  f(x,  y)  which  appears 
in  (2.1)  has  suitable  high  order  smoothness;  that  is,  when  it  is  continuous 
and  has  continuous  derivatives  of  suitable  orders.  Thus  the  reader  must 
exercise  discrimination  in  choosing  which  order  method  to  use.  When 
they  can  be  used,  the  high  order  methods  permit  the  use  of  coarse 
meshes.  This  can  greatly  reduce  the  labor  of  computation. 

lor  difference  approximations  of  order  2,  one  can  use  mesh  elements 
which  are  rectangles,  rather  than  squares.  See  Hockney  [1].  In  this 
case,  there  would  be  no  trouble  if  the  ratio  of  the  sides  of  R were 
irrational.  For  difference  approximations  of  order  4,  one  can  also  use 
mesh  elements  which  are  rectangles.  See  Rosser  [ 5 ] . For  difference 
approximations  of  order  6,  it  appears  that  the  mesh  elements  have  to  be 
squares.  Details  are  presented  in  Rosser  [5].  If  f(x,  y)  in  (2.1)  is 
sufficiently  smooth,  this  permits  one  to  use  quite  a coarse  mesh,  greatly 
reducing  the  computational  labor.  However,  this  raises  the  question 
how  to  proceed  if  the  ratio  of  the  sides  of  R is  irrational. 

4 • 111 -proportioned  rectangles . 

We  take  h to  be  the  side  of  the  square  mesh  element.  We 
arrange  that  the  squares  can  be  fitted  along  the  base  of  R.  That  is, 
we  take  M to  be  a positive  integer,  and  define 
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Wo  take  N to  bo  the  integer  part  of  aM/rr;  in  symbols 


'aM_ 

(4.2) 

N = 

IT 

Then 
(4.  3) 
(4.  4) 


Nh  < a , 

(N  + 1)  h > a . 


(4.  5)  Nh  = a , 

then  we  can  fill  up  the  rectangle  R exactly  with  MN  squares  of 
side  h,  and  the  methods  of  Rosser  [ 5]  are  applicable.  So  we  are 
interested  only  in  the  case  Nh  < a.  We  could  assume  this,  but  it  is 
not  required  for  the  analysis  v/hich  follows.  If  we  should  have  (4.  5) 
holding,  then  some  of  the  steps  of  the  subsequent  analysis  would  be 
quite  trivial  but  not  incorrect  in  any  way. 

We  begin  by  defining 
(4.  6)  b = Nh 

(4.7)  c = a - b = a - Nh  . 

Wo  take  R,  to  be  the  rectangle  with  corners  (0,0),  (0,b),  (it,0),  and 
b 

(rr,  b),  and  take  Rc  to  be  the  rectangle  with  corners  (0,c),  (0,a), 

(ir,  c)  and  (tt,  a). 

We  choose  h (x)  to  be  a smooth  function  such  that 
b 

hb(0>  = g0(b) 

hb<">  = 9,0=)  • 


fc' 


i 


The  better  we  can  choose  h^(x)  to  approximate  u(x,  b);  the  more  we 
can  curtail  certain  computations  later.  With  the  limited  information 
available  at  this  stage,  we  content  ourselves  with  taking 

h,(x)  = h (x)  + (1  - - )(g  (b)  - h (0))  + ~ (g  (b)  - h (tt))  . 
d a no  a tt  tt  a 

We  take  u^x,  y)  to  be  the  function  which  is  continuous  on  and 

inside  R,  , satisfies  (2.1)  inside  R,  , and  on  the  sides  of  R, 
b’  b b 

satisfies  the  boundary  conditions 


(4.8) 

(4.9) 

(4.10) 

(4.11) 


Ub(°,  y)  = 90(y) 

U,  (tt,  y)  = g (y) 

D TT 

ub(x, 0)  = ho(x) 
ub(x,  b)  = hfa(x) 


0 < y < b 

0 < y < b 

0 < x < XT 

0 < x < XT 


We  take  u (x,  v)  to  the  function  which  is  continuous  on  and  inside 

R , satisfies  (2.1)  inside  R , and  on  the  sides  of  R satisfies  the 
c c c 

boundary  conditions 


(4.12) 

u J°,  y)  = g Jy) 

k/  o 

c 

< y < a 

(4.13) 

u (TT,y)  = g (y) 

C TT 

c 

< y < a 

(4.14) 

uc(x,c)  = ub(x,c) 

0 

< x _<  XT 

(4.15) 

u (x,  a)  = h (x) 

C a 

0 

< X < TT 

By  our  definition  of  h^(x),  we  see  that  ub(x>  y)  has  continuous 
boundary  conditions  around  the  rectangle  R^.  Then  it  follows  by  (4.14) 
that  the  same  holds  for  u^Jx,  y)  relative  to  the  rectangle  R^.  This 
is  why  in  (4.8)  through  (4.15)  we  can  use  < rather  than  <. 
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By  (4. 1)  and  (4.  6)  we  can  fill  up  the  rectangle  R.  • exactly  with 

b 

MN  squares  of  side  h.  Thus  we  can  use  the  9-point  difference 

approximation  of  Rosser  [ 5]  to  get  accurate  approximations  for  u (x,  y) 

inside  R^  at  the  grid  points  (mh,nh).  From  these,  we  can  get 

accurate  approximations  for  ujmh,c).  By  (4.14)  these  are  part  of 

the  boundary  values  for  u^(x,  y).  Thus  it  is  necessary  to  determine 

them  to  order  h . By  the  principle  of  the  maximum,  it  is  also  sufficient. 

For  a given  m,  the  point  (mh,c)  is  on  a vertical  grid  line.  Thus 

one  can  determine  u (mh,c)  to  order  h^  by  using  a high  order 

interpolation  formula  in  one  dimension  on  the  values  at  the  six  grid 

points  (mh,  0),  (mh,  h),  (mh,  2h),  (mh,  3h),  (mh,  4h),  and  (mh,  5h). 

By  (4.14),  this  gives  us  good  approximations  to  u^(x,  c)  at 

x = h,  2h,  . . . , (M  - l)h.  By  (4. 1)  and  (4.  7)  we  can  fill  up  the  rectangle 

R^  exactly  with  MN  squares  of  side  h.  Thus  we  can  use  the  9-point 

difference  approximation  of  Rosser  [ 5]  to  get  accurate  approximations 

for  u (x,  y)  inside  R at  the  grid  points  (mh,  c + nh).  Then  we 
c c 

can  get  accurate  approximations  for  u (mh,  b)  by  the  method  mentioned 

c 

earlier. 

We  define  R^  to  be  the  rectangle  which  is  the  intersection 

of  the  rectangles  R,  and  R . In  R,  , the  function  u (x,  y)  - u,  (x,  y) 

b c be  c b ’ 

is  harmonic.  Also,  it  is  zero  along  the  bottom  and  along  the  two 
vertical  sides.  So  on  and  inside  R.  we  have 
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(•).  16) 


“„(*■ y)  - ub(x' y) : - ar  ft 


sinh  r(b  - c) 


where 


(4.17) 


2 (■ 

a = ~ {u  (x,  b)  - u,  (x,  b)}sin  rx  dx 
r it  ^ c b 


Clearly  the  la^l  are  bounded  by 


(4.18) 


2 max  |u  (x,  b)  - u (x,  b)  | . 
0 <x  <tt  C 


We  recall  (see  (4.11))  that 

ufa(x,b)  = hfa(x)  . 

Presumably  u (xf  b)  is  fairly  close  to  u(x,  b).  If  also  we  were  lucky 
enough  to  choose  h^(x)  fairly  close  to  u(x,  b),  then  by  (4.18)  the 
a^  will  be  fairly  small.  This  will  save  computational  effort  later. 

On  and  inside  R define 


(4.19) 


oo 

, \ Y i sinh  r(a  " v) 

v(x,  y)  = a b TT  sin  rx  , 

’ r r sinh  ra 


b _ sinh  rc 
r sinh  r(b  - c) 


On  and  inside  R,  define 
b 


UU 

u(x’  y)  = ub(X’  y)  + V(X'  y)  + £.  “r 

r=l 


sin  rx  . 


Wo  see  that  u(x,  y)  is  continuous  on  and  inside  the  rectangle 


i •* 

a 


f 

4* 


R , satisfies  (2.1)  inside  R , and  on  three  sides  satisfies  the  boundary 
b b 

conditions  (4. 8),  (4.9),  and  (4.10).  By  (4.16),  we  see  that  on  and 

inside  R,  we  have 
be 

(4.22)  u(x,  y)  = ujx,  y)  + v(x,  y)  . 

We  use  (4.  22)  to  define  u(x,  y)  for  the  rest  of  the  rectangle  Rc- 
Then  u(x,  y)  is  continuous  on  and  inside  the  rectangle  Rc>  satisfies 
(2.1)  inside  R^,  and  on  three  sides  satisfies  the  boundary  conditions 
(4.12),  (4.13),  and  (4.15). 

Thus  we  see  that  u(x,  y)  is  exactly  the  function  u(x,  y)  that 
we  were  seeking  to  obtain. 

We  have  obtained  accurate  approximations  for  u^x,  y)  and 
u^(x,  y)  at  various  grid  points.  If  M is  of  reasonable  size,  then  c 
is  small,  since  0<c<h  by  (4.7),  (4.3),  and  (4.4).  As  a is 
greater  than  tt,  and  b = a - c by  (4.7),  we  see  that  the  series  on 
the  right  of  (4. 19)  is  rapidly  convergent  for  0 < y < a.  Also,  the  series 
appearing  on  the  right  of  (4.  21)  is  rapidly  convergent  for  small  y, 
certainly  for  0 < y < h.  If  in  addition  the  a^  are  all  quite  small  (see 
(4.18)),  then  very  few  terms  of  the  series  are  needed  to  get  high  accuracy. 
So,  using  the  known  approximations  for  u^(mh,nh),  we  can  get 
approximate  values  for  u(x,  y)  for  small  y by  (4.21).  Tor  all  other 
values  of  y,  we  can  use  the  known  approximations  for  u^mh,  c + nh) 
to  get  approximate  values  for  u(x,  y)  by  (4.22). 
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The  calculation  of  the  af  presents  no  problem.  Not  more  than 
four  or  five  will  be  required;  fewer  if  the  a^  are  all  small.  Observe 
that  the  values  of  u^x,  b)  are  given  by  ( -1 . 11) . Also,  we  had  got 
accurate  approximations  for  u^(mh,  b).  So  we  can  use  a numerical 
quadrature  formula  to  calculate  the  a^  by  (4.17). 

CAUTION.  If  r is  not  fairly  small  compared  to  N,  then  there 

will  be  fairly  few  abscissa  points  in  each  cycle  of  sin  rx  in  (4.17); 

in  such  case  the  usual  quadrature  formulas  are  not  trustworthy.  One 

can  get  twice,  or  four  times,  or  eight  times,  as  many  abscissa  points 

by  interpolating  to  get  approximations  for  u (x,  b)  at  the  additional 

abscissa  points  (recall  that  u^(x,  b)  is  given  by  (4.11)).  For  this 

interpolation  one  can  use  a high  order  one  dimensional  interpolation 

formula  on  the  values  u (0,b),  u (h,b),  u (2h,b),...  . 

c C o 

We  need  high  accuracy  for  only  the  first  one  or  two  of  the  ar> 
because  of  the  very  rapid  convergence  of  the  series  appearing  on  the 
right  of  (4.19)  and  (4.  21).  In  any  case,  one  should  increase  the  number 
of  abscissa  points,  as  needed,  to  the  point  where  one  can  use  a 
quadrature  formula  with  assurance.  Also,  by  a little  foresight  in  the 
choice  of  M,  one  can  arrange  that,  after  increasing  the  number  of 
abscissa  points  if  needed,  one  can  use  a high  order  quadrature  formula, 
like  Bode's  Rule,  for  example. 


r 


5 . Tests  for  accuracy . 


One  advantage  of  using  the  9 -point  difference  approximation  when 
one  can  exactly  fill  up  the  rectangle  with  squares  is  that  one  can  make 
a first  calculation,  for  less  than  a quarter  of  the  calculating  effort, 
with  squares  twice  as  large  on  a side,  and  then  repeat  with  the  smaller 
squares.  Because  the  error  is  of  the  order  of  h^,  one  can  get  an 
estimate  of  the  error. 

This  can  be  done  with  the  present  procedure  by  choosing  M 
divisible  by  2.  If  N is  not  divisible  by  2,  the  values  of  b and  c 
which  are  used  with  the  squares  of  side  2h  will  not  be  the  same  as 
those  which  are  used  with  the  squares  of  side  h.  However,  this 
does  not  matter. 

One  dividend  that  will  accrue  from  making  an  initial  calculation 
w’th  squares  of  side  2h  is  that  from  this  calculation  one  can  derive 
a very  good  approximation  to  take  for  h^(x).  Then,  for  the  calculation 
with  squares  of  side  h,  the  a^  will  be  very  small,  so  that  not  more 
than  two  or  three  of  them  will  be  needed. 

6.  Neumann  boundary  conditions. 

Suppose  we  have  the  same  rectangle  R,  and  impose  on  u(x,  y) 
the  same  conditions  as  before,  except  that  on  top  of  the  rectangle  R 
we  specify  values  to  be  taken  by  u^(x,  a).  That  is  we  replace  (2.5) 
by  the  Neumann  condition 

(6.1)  u^(x,  a)  = k^(x)  0 < x < ir  . 
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We  postpone  to  the  latter  part  of  the  section  a discussion  of  how 


* 


[ 


one  would  handle  this  in  the  case  in  which  a/n  is  rational,  so  that 

one  can  fill  up  R exactly  with  squares  of  side  h.  For  the  moment, 

let  us  assume  that  this  can  be  done,  and  explain  how  to  generalize  to 

the  case  in  which  a/n  is  irrational. 

We  proceed  very  nearly  as  in  Section  4.  Instead  of  the  definition 

given  there  of  h.  (x),  we  use 
b 

(6.  2)  h (x)  = (1  - ~)  g (b)  + - g (b)  . 

O TT  O IT  U 

We  take  u (x,  y)  as  before,  but  for  u (x,  y)  we  replace  (4.15)  by 

D C 

the  analogue  of  (6.1),  namely 

(6.  3)  u (x,  a)  = k (x)  0 < x < tr  . 

9y  c a 

Everything  now  goes  the  same,  down  to  the  definition  of  v(x,  y).  Let 
us  pause  a moment,  and  think  what  we  require  of  v(x,  y).  Clearly  it 
should  be  harmonic,  so  that  u(x,  y),  as  defined  in  part  by  (4.21)  and 
in  part  by  (4.22),  will  satisfy  (2.1)  inside  R.  Also,  we  wish  v(x,  y) 
to  be  zero  on  the  vertical  sides  of  R,  so  that  there  u(x,  y)  will 
satisfy  the  proper  boundary  conditions.  Also,  on  the  bottom  of  R,  we 
must  have 


(6-4) 


v(x,  0)=  l 
r=l 


sinh  rc 


r sinh  r(b  - c) 


sin  rx 


0 < x < IT 


so  that  by  (4. 21)  u(x,  y)  will  satisfy  the  right  boundary  conditions 
on  the  bottom  of  R.  Finally,  looking  at  (4.22),  we  see  that  if  u(x,  y) 
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is  to  satisfy  the  right  boundary  conditions  on  the  top  of  K,  we  must  have 


(6.6) 


v (x,  a)  = 0 

y 


0 < X < TT  . 


All  these  conditions  can  be  met  by  simply  replacing  the  factor 


sinh  r(a  - y) 
sinh  ra 


in  the  definition  of  v(x,  y)  by 

cosh  r(a  - y) 
cosh  ra 

In  this  case,  since  it  is  unlikely  that  (6.  2)  makes  hb(x)  come 

out  very  close- to  u(x,b),  we  cannot  count  on  the  af  being 
particularly  small,  so  that  two  or  three  more  of  them  might  have  to  be 

calculated.  It  might  be  better  to  turn  the  rectangle  R upside  down 

and  proceed  as  follows. 

Consider  next  the  case  in  which  the  Neumann  condition  is  at  the 
bottom  of  R.  That  is,  u(x,  y)  satisfies  (2.  2),  (2.3),  and  (2.  a), 


but  (2. 4)  is  replaced  by 


(6.  6) 


u (x, 0)  = k (x) 
y O 


0 < X < ir  . 


Again,  we  proceed  nearly  as  in  Section  4.  We  can  now  take 


h (x)  the  same  as  in  Section  4,  which  should  lead  to  smaller  values 

of  the  a,  so  that  we  can  get  by  with  calculating  fewer  of  them.  Tor 

the  definition  of  ub(x,y),  we  replace  (4.10)  by  the  analogue  of 
(6.6),  namely 


(6.7) 


u (x,  0)  - k (x) 

ay  b'  O 


0 < x < it  . 


We  take  u (x  y)  as  in  Section  4,  and  continue  the  same  down  to  the 
c 

definition  of  v(x,  y).  A key  requirement  is  that  u(x,  y),  as  defined 
by  (4.  21),  shall  satisfy  the  proper  boundary  conditions  at  the  bottom  of 
R.  In  Section  4,  this  required  that 


(6.8) 


v(x,  y)  + X a “ 

' ’ LJ,  r s i 
r = l 


sinh  r( 


inh  r(b  - c) 


should  be  zero  when  y = 0.  This  was  accomplished  by  the  proper 

choice  of  the  b . Now  we  must  assure  that  the  partial  derivative  of 

r 

(6.8)  with  respect  to  y shall  be  zero  when  y - 0.  Again,  this  is 
accomplished  by  the  proper  choice  of  the  bf;  specifically  we  now  take 


(6.9) 


- sinh  ra  cosh  rc 
r sinh  r(b  - c)  cosh  ra 


All  else  remains  the  same. 

Next  consider  the  case  in  which  there  are  Neumann  conditions 
both  at  the  top  and  the  bottom  of  R.  That  is,  u(x,  y)  satisfies  (2.  2) 
and  (2.  3),  but  (2.  4)  is  replaced  by  (6.  6)  and  (2.  5)  is  replaced  by  (6.1). 
We  proceed  much  as  in  Section  4.  In  the  definition  of  u^(x,  y)  we 
replace  (4. 10)  by  (6. 7),  and  in  the  definition  of  u^x,  y)  we  replace 
(4.15)  by  (6.  3).  We  define  h (x)  by  (6.2).  It  is  then  easily  verified 
that  we  should  replace 


sinh  ra 


in  the  definition  of  v(x,  y)  by 


cosn  r(a  - 
cosh  ra 
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and  define 


: 

,,  . cosh  ra  cosh  rc 

(o. 10)  b = —r  — : ——  . 

r sinh  r(b  - c)  sinh  ra 

One  can  of  course  have  Neumann  conditions  on  one  or  both  of  the 
vertical  sides.  Let  us  consider  first  the  case  in  which  there  are  Neumann 
conditions  on  both  vertical  sides,  but  Dirichlet  conditions  at  the  top  and 
bottom.  Rotation  by  90  would  reduce  this  to  the  case  just  considered. 
However,  this  is  not  desirable,  since  we  would  then  lose  the  qualifica- 
tion that  the  height  is  greater  than  the  base.  It  was  this  that  assured 
the  rapid  convergence  of  the  Fourier  series  in  (4.19)  and  (4.21). 

So  we  assume  that  (2.  4)  and  (2.  5)  hold,  but  that  (2.  2)  and  (2.  3) 
are  replaced  by 


(6.11) 

uy(°i  y)  = j (y) 

A O 

0 < y < a 

(6.12) 

u (tt,  y)  = j (y) 

X TT 

0 < y < a 

We  proceed  analogously  to  Section  4,  except  that  we  use  cosines 
instead  of  sines  throughout.  Because  it  is  desirable  to  have  u^(x,  y) 
continuous  around  the  boundary  we  define 

1 2 x2 

I (6.13)  h (x)  = h (x)  + — (x  - tr)  (h*  (0)  - j (b))  + — (j  (b)  - h'  (tt))  . 

•9  u a £tt  a o c tr  tt  a 

U We  define  u (x,y)  and  u^(x,  y)  as  in  Section  4,  except  that 

* they  now  have  Neumann  conditions  on  their  vertical  sides.  We  replace 

(4. 16)  and  (4.17)  by 

£ (6. 14)  uc(x,  y)  - ub(x,  y)  = 

t 

i 

E - 


» 


where 

1 

a.  = ~ 

0 ir 

TT 

(6.15) 

/ {u  (x,  b)  - ub(x,  b)}dx 

o c 

TT 

(6.16) 

a = 

r tt 

f {u  (x,  b)  - u,  (x,  b)}cos  rx  dx  . 

0 c k 

l *' 


\ 

r *■ 


) 1 


When  r = 0,  we  define 

sinh  r(y  - c)  _ y - c 
sinh  r(b  - c)  b - c 

Exactly  analogous  changes  are  made  in  (4.19)  and  (4.21). 

If,  in  addition  to  the  Neumann  conditions  on  the  vertical  sides, 
we  replace  one  or  both  of  the  Dirichlet  conditions  on  the  top  or  bottom 
by  Neumann  conditions,  we  can  modify  the  procedure  just  outlined 
quite  analogously  to  the  way  in  which  we  modified  the  procedure  of 
Section  4 earlier  in  this  section. 

It  v/ill  be  noted  that  we  are  allowing  the  possibility  of  Neumann 
conditions  on  all  four  sides.  For  this,  there  will  be  a solution  only  if 
the  boundary  conditions  satisfy  a certain  criterion.  If  they  do,  the 
solution  is  not  unique,  but  any  two  solutions  differ  by  a constant.  The 
procedure  outlined  v/ill  produce  one  of  this  infinity  of  solutions  if  and 
only  if  there  is  a solu.ion. 

To  handle  the  case  of  a Dirichlet  condition  on  the  left  side  and  a 
Neumann  condition  on  the  right  side,  we  replace  sin  rx  by 


with  suitable  related  changes.  To  handle  the  case  of  a'Dirichlet 
condition  on  the  right  side  and  a Neumann  condition  on  the  left  side, 
we  replace  sin  rx  by 


cos(r  - ~)x  . 


We  consider  finally  how  to  handle  the  case  in  which  the  rectangle 

has  a rational  ratio  of  the  sides,  and  we  have  filled  it  exactly  with 

squares  of  side  h,  and  wish  to  approximate  u(x,  y)  at  the  grid  points. 

At  interior  grid  points,  we  can  use  one  of  the  formulas  of  Rosser  [5]. 

On  boundaries  where  there  are  Dirichlet  boundary  conditions,  we  assign 

u the  specified  value.  This  leaves  only  the  boundary  points  where 
m,  n 

there  is  a Neumann  condition  to  be  dealt  with.  Suppose,  for  example, 
that  the  condition  (6.11)  holds  on  the  left  side  of  R.  We  note  that 

(6.17)  hfx(x’  y)  s f(x>  y) + 5f(x  + h»  y) _ 5f(x  + 2h>  y) 

+ y f(x  + 3h,  y)  - ~ f(x  + 4h,  y)  + ^ f(x  + 5h,  y) 

holds  to  within  terms  of  order  h^.  If  we  take  x = 0 and  y = nh, 

we  get  by  (6. 11) 

(6.18)  W0(nh)*-  lJVn  + 5“l,n 

10  - 5 - 1 - 

" 5U2,n  + 3 U3,n  ~ 4 U4,n  5 U5,n  * 

One  could  use  a higher  order  formula  than  (6.17),  but  it  probably 

suffices.  A heuristic  argument  for  this  is  as  follows.  By  the  principle 

6 

of  the  maximum,  if  we  wish  to  determine  interior  points  to  order  h , 


it  is  sufficient  to  determine  the  boundary  points  to  order  h . However, 
if  the  interior  points  arc  given  to  order  h , ( 6.18)  will  determine 

u„  to  order  h^. 

0,  n 

Use  of  (6.18)  with  the  formulas  of  Rosser  [ 5]  results  in  a rather 

messy  matrix  of  coefficients  of  the  u . However,  one  is  probably 

m,  n 

using  such  a coarse  mesh  that  this  matrix  would  be  less  than  100  x 100, 
perhaps  even  less  than  50  x 50.  If  so,  probably  the  quickest  method  of 
solution  is  to  use  the  standard  computer  routine  for  solving  simultaneous 
linear  equations.  If  this  is  done,  it  does  not  much  matter  if  the  matrix 
is  messy  or  not. 

If  it  happens  that  one  is  solving  the  Laplace  equation,  with 
f(x,  y)  = 0,  and  has  a zero  normal  derivative  along  one  side,  say 
J (y)  = 0,  one  can  use  the  reflection  principle  to  replace  (6.18)  by 
something  which  seems  conceptually  simpler.  However,  it  involves 
three  boundary  gnd  points  and  three  interior  points,  and  so  is  probably 
about  as  much  bother  on  a computer  as  (6.18),  which  also  involves  six 
grid  points. 

If  one  has  Neumann  conditions  on  one  or  more  sides,  and  so  is 
using  (6.18),  one  might  consider  the  following  procedure,  which  would 
bypass  the  treatment  in  Section  4 altogether.  Almost  always,  there  is 
at  least  one  side  with  Dirichlet  conditions.  By  rotating  and  relinquishing 
the  qualification  a > it,  if  need  be,  we  can  arrange  to  have  Dirichlet 
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conditions  on  top.  If,  in  the  notation  of  Section  4,  we  have  0 < c < h, 
the  difficulty  is  that  wo  have  no  good  way  to  write  down  an  equivalent 
of  (3.7)  of  Rosser  [5]  for  the  values  of  u(x,  y)  at  the  row  of  grid 
points  (mb,  Nh),  1 <m  < M - 1.  As  a substitute,  write  down  (3.7)  of 
Rosser  [5]  for  the  9-point  formula  centered  at  (mil,  a - h).  It  involves 
values  of  u(x,  y)  at  ((m  - l)h,  a - h),  ((m  - l)h,  a - 2h),  (mh,  a - h), 

(mh,  a - 2h),  ((m  + l)h,  a - h),  ((m  + l)h,  a - 2h),  as  well  as  at  the 
boundary  points  ((m  - l)h,  a),  (mh,  a),  and  ((m  + l)h,  a),  at  which 
latter  points  u(x,  y)  is  known.  Now,  by  a high  order  one  dimensional 
interpolation  formula,  we  can  write  each  of  u(rh,  a - h)  and  u(rh,  a - 2h), 
approximately  as  a linear  combination  of  u(rh,nh)  for  n < N;  we  do 
this  for  r = m - 1,  r - m,  and  r = m + 1.  So  we  get  a formula 
involving  u(rh,  Nh),  u(rh,(N  - l)h),  etc.,  for  r = m - 1,  m,  m + 1, 
which  we  can  use  in  place  of  (3.7)  of  Rosser  [5].  Probably  interpolation 
of  order  eight  should  be  used.  This  makes  the  matrix  still  messier,  but 
if  we  are  having  to  deal  with  a messy  matrix  anyhow,  because  of  the 
Neumann  conditions,  the  idea  might  be  worth  considering. 
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SOLUTIONS  TO  INITIAL  VALUE  PROBLEMS  USING 
FINITE  ELEMENTS  - UNCONSTRAINED  VARIATIONAL  FORMULATIONS 
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ABSTRACT.  This  paper  presents  a variational  formulation  which 
treats  initial  value  problems  and  boundary  problems  in  a unified 
manner.  The  basic  ingredients  of  this  theory  are  (1)  adjoint  variable 
and  (2)  unconstrained  variations.  It  is  an  extension  of  the  finite 
element-unconstrained  variational  formulation  used  previously  in  solving 
several  nonconservative  stability  problems.  The  technique  which  makes 
this  extension  possible  is  described.  This  formulation  thus  enables 
one  to  adapt  such  numerical  technique  as  the  finite  element  method, 
which  has  had  great  success  and  popularity  for  solution  of  boundary 
value  problems,  for  solutions  of  initial  value  problems  as  well.  These 
formulations  are  given  here  for  a forced  vibration  problem,  a heat 
(mass)  transfer  problem  and  a wave  propagation  problem.  Numerical 
calculations  in  conjunction  with  finite  elements  for  two  specific 
examples  are  obtained  and  compared  with  known  exact  solutions. 

1.  INTRODUCTION.  In  its  application  to  the  solutions  of  engineering 
problems,  the  finite  element  discretization  has  been  implemented  almost 
exclusively  to  the  spatial  dimensions.  For  dynamic  or  time-dependent 
problems  whose  solutions  as  functions  of  time  are  of  interest,  a step- 
by-step  procedure  of  finite  difference,  i.e.,  the  quasi-static  approach 
is  usually  employed.  The  answer  to  the  question  why  the  time  dimension 
has  not  been  treated  equally  with  the  spatial  variables  in  the  finite 
element  discretization  must  be  related,  in  part  at  least,  to  the 
development  of  variational  methods,  since  the  finite  element  procedure 
can  be  viewed  most  readily  as  an  extremizing  sequence  associated  with  a 
variational  statement.  While  there  are  numerous  variational  principles 
for  boundary  value  problems,  few  exist  for  initial  value  problems.  Like 
many  problems  involving  nonconservative  forces,  the  difficulty  appears  to 
be  that  initial  value  problems  are  nonself-adjoint  and  thus  they  do  not 
possess  variational  principles  in  the  classical  sense.  In  conjunction 
with  problems  involving  nonconservative  forces,  certain  constrained 
variational  principles  (sometimes  called  extended  Hamilton's  principles 
--See,  for  example,  ref.  [1])  were  used  for  finite  element  solution 
formulations  [2,  3].  Shortly  afterwards,  using  the  combined  notion  of 
the  Lagrange  multipliers  and  the  adjoint  variable,  some  unconstrained 
variational  statements  were  established  and  used  as  bases  for  finite 
element  solutions  [4,  5].  This  approach  has  been  shown  to  be  more 
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advantageous  in  terms  of  simplicity, versatility  and  the  rate  of  conver- 
gence compared  with  the  constrained  variational  approach  [5,  6]. 

Fried  was  first  to  treat  the  time-dimension  identically  with  the 
space  dimensions  in  using  the  finite  elements  [7].  His  solution 
formulations,  however,  emanate  from  constrained  variational  principles. 

In  contrast,  this  paper  presents  a generalization  of  the  unconstrained 
variational  approach  to  time-dependent  problems. 

At  this  point,  the  variational  principles  of  integrals  of  convo- 
lution developed  by  Gurtin  [8,  9]  should  be  mentioned.  The  applications 
of  these  principles  in  conjunction  with  finite  elements  in  the  time- 
dimension  [10,  11,  12,  13]  have  so  far  failed  to  show  any  advantage  over 
the  procedure  described  by  Fried.  In  fact,  all  these  analyses  had  to 
resort  to  either  the  Fried’s  or  some  other  similar  step-by-step  procedure 
to  complete  the  solutions  in  the  time-dimension. 

In  this  paper,  the  use  of  unconstrained  variational  principles  - 
finite  elements  for  usual  boundary  value  problems  is  first  illustrated 
and  the  advantages  over  the  constrained  formulations  are  pointed  out. 

The  unconstrained  variational  principles  can  always  be  constructed 
through  the  use  of  the  Lagrange  multipliers.  The  unconstrained  vari- 
ations are  then  shown  to  lead  naturally  to  (nonself-)  adjoint  variational 
statements.  Thus,  nonconservative  problems  can  be  formulated  easily 
using  finite  elements.  The  application  to  a control  problem  is  given 
[14].  With  the  introduction  of  a cross-product  term  involving  two-point 
boundary  (initial)  values,  the  unconstrained  variational  - finite  element 
formulation  is  again  easily  extended  to  include  time-dependent  problems. 
This  formulation  is  obviously  simpler  compared  with  those  derived  from 
Gurtin 's  variational  principles  because  no  convolutional  integrals  are 
needed.  It  is  also  easier  to  use  and  more  versatile  than  the  Fried's 
procedure  due  to  the  fact  that  no  boundary  or  initial  conditions  are 
involved  in  the  solution  formulation  and  because  of  the  nature  of  the 
Lagrange  multipliers.  As  further  examples  of  application,  finite 
element  matrix  equations  are  derived  for  several  transient  problems 
Including  a force  vibration,  a heat  transfer  and  a wave  propagation 
problem.  Detailed  formulations  and  numerical  results  of  two  examples 
are  given  and  comparisons  with  some  known  exact  solutions  are  made. 

2.  LAGRANGE  MULTIPLIER  AND  FINITE  ELEMENT  FORMULATIONS.  One  of 
the  advantages  of  the  finite  element  method  is  its  capability  of  solving 
large  complicated  problems  in  a routine  manner.  However,  the  same  con- 
cepts used  in  a program  for  large  systems  may  be  understood  using 
relatively  simple  problems. 

Let  us  consider  the  stability  of  a Euler's  column.  The  governing 
equations  are  as  follows: 


7C 


u"(4)  = 0 (Id) 

F I u"'  (£)  + P u'  (£)  =0  (le) 


where  u is  the  lateral  displacement,  a prime  (')  denotes  differentiation 
with  respect  to  the  coordinate  x;  E is  the  Young's  modulus,  p,  density 
of  the  material;  I is  the  second  moment.  A,  area  of  the  cross-section,  i, 
length  of  the  beam  and  tu  is  the  eigenvalue.  For  eqs.(l),  a usual  varia- 
tional principle  can  be  written: 

<5J ! (u)  = 0 (2a) 

where  . 

J1  C»)  = j J*  [E  I (u")2  - P(u')2  + u)2pAu2]  dx  (2b) 

To  establish  the  equivalence  between  eqs.  (1)  and  (2),  one  simply 
carries  out  the  variation  of  in  eq.  (2a): 

6J1  = Jq  [E  I u"<Su"  - P u'6u'  + u)2  pA  u 6u]  dx  (3a) 

= [E  I u""  + P u"  + a)2  pAu]  6u  dx 

+ [E  I u"  6u'  - (E  I u"'  + P u')  6u]x  _ £ 

- [E  I u"  6u 1 - (El  u"'  + P u')  6u]x  = 0 (3b) 

, From  eq.  (3b)  one  observes  that  for  the  coordinate  functions  and  their 

variations  satisfying  the  boundary  conditions  in  eqs.  (lb  - le),  eq. 

(la)  implies  eq.  (2a)  and  vice  versa.  The  finite  element  formulation 
for  this  problem  begins  with  eq.  (3a). 

| 

Let 

u(x)  = a (x)  U (4) 


where  a(x)  is  the  displacement-function  vector  and  U , the  generalized 
displacement  vector.  Upon  the  substitution  of  eq.  (4)  into  eq.  (3a), 
one  immediately  obtains 


Eq.  (5)  is  not  yet  ready  to  be  solved  since  neither  U nor  6U 
consists  of  independent  elements  due  to  the  boundary  conditions  re- 
quirements placed  on  u(x). 

Let  us  now  consider  a slightly  different  variational  principle: 


with 


6J2  = 0 


J2  = L j*  [E  I (u")2  - P (u')2  + to2  pAu2] 

+ iaj  [u(0)]2  ♦ I ci2  [u'(0)]2 

where  oij  and  a2  are  the  Lagrange  multipliers. 

Carrying  out  the  variation  of  eqs.  (7),  we  have 
% 

6J-,  = f [El  u"  6u"  - Pu'  6u  + w2pAu2]  dx 
i o 

+ ax  u (0)  6u(0)  + a2  u'  (0)  du'(0) 

= [E  I u"”  + Pu"  + a)2  pAu]  du  dx 
o 

+ [E  I u"  6u'  - (E  I u'"  + Pu')  <$u]x=J, 

- [ (E  I u"  - a,u'  - (El  u"'  + Pu'  + a^u)  6u] 


dx 


(7a) 


(7b) 


(8a) 


x = 0 


(8b) 


Eq.  (8b)  states  that  a necessary  and  sufficient  condition  for 
6 J2  = 0 is  the  problem  defined  by  the  following  sec  of  equations: 

E I u""  + Pu"  + io2pAu  = 0 (9a) 

E I u" (0)  - a2  u'  (0)  = 0 (90) 

E I u’"  (0)  + Pu'(0)  + ax  u(0)  = 0 (9c) 

E I u"(Jl)  = 0 (9d) 

E I u"'  (£)  + Pu'  (£)  = 0 (9e) 

provided  that  the  variation  6u  is  completely  arbitrary,  comparing 
eqs.  (9)  and  (1),  it  is  seen  that  eqs.  (1)  is  a special  case  of  (9)  as 
, a2  approach  to  infinity.  From  eq.  (8a),  we  can  see  that  the  finite 
element  matrix  equation  now  becomes 
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where 

K2  = Ki  + CtjaCOa^O)  + a2a' (0)a'T(0)  (11) 

The  matrix  £ in  eq.  (11)  has  been  defined  in  eq.  (5)  and  the  super- 
script T denotes  the  transpose  of  a matrix  (a  vector) . Since  6u  is 
arbitrary,  6(J  in  eq.  (10)  is  arbitrary,  eq.  (10)  leads  directly  to  the 
final  matrix  equation  to  be  solved. 

|jC2  + co2m|  U = 0 (12) 

It  is  then  clear  that  the  method  of  Lagrange  multipliers,  used  in 
conjunction  with  the  finite  element  method,  will  not  only  facilitate 
the  solution  formulations  but  also  encompass  a larger  class  of  problems 
to  be  solved  compared  with  the  use  of  constrained  variational  statements. 
The  applications  of  the  same  general  concept  can  be  extended  further. 

3.  FROM  UNCONSTRAINED  VARIATIONS  TO  ADJOINT  VARIATIONAL  STATEMENTS. 

We  have  noted  that  the  variation  6u  in  eq.  (8)  is  quite  independent 
of  the  function  u itself  and  nothing  will  be  changed  if  we  simply 
replace  6u  with  Sv  to  emphasize  this  independence.  This  substitution, 
however,  has  suggested  the  adjoint  variational  principles.  Let  us 
consider 


S 

; i 


\ 


l i 


% 
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<5J3  = 0 (13a) 

J3  = (El  u"v"  - Pu’v'  + w2pAuv)  dx 
+ aju(0)v(0)  + a2u'(0)v(0)  + a^Pu ' (£)v  (£.)  (13b) 

Carrying  out  the  variations,  we  have: 

«J3  = (6J3)u  * t«J3)y  <l4> 

where 

(6J,)  = / (El  u"6v"  - Pu'6v'  + w2pAn6v)  dx 

J u 0 

+ ctj  u(0)6v(0)  + o^u' (0) Sv' (0)  + a^u'  (£)  Sv(Jl)  (15a) 

,1 

= (El  u""  + Pu"  + w2pAu)  6v  dx 
+ (E  I u"6v'  - (E  I u"'  + Pu'  - a3u')  6v]x  _ 


r 


- [ (E  I u"  - a2u')  6v'  -(El  u"'  + Pu'  + a^u)  6v]x  _ Q (15b) 
and 

(6Jj)v  = J0  (El  v"  6u"  - Pv'6u'  + 0)2pAv6u)  dx 

♦ aj  v(0)  6u(0)  + a2  v'(0)  6u'  (0)  + a3  v(£)  6u'  (A) 

(E  I v""  + Pv"  + U)2pAv)  6u  dx 

♦ [(E  I v"  + a3v)  6u'  - (El  v'"  + P v')  6uJx  _ ^ 

- [(E  I v"  - ct9  v')  6u'  - (El  v"'  + Pv'  + ai  v)  6u] 

*•  x = 0 

From  eq.  (15a),  it  is  clear  that  a necessary  and  sufficient 
condition  for  (6J3)U  =0  is  the  problem  defined  by  the  following  set 
of  equations: 

D.E.  El  u""  + Pu"  + to2pAu  = 0 (17a) 

B.C.  E I u"(J,)  = 0 (17b) 


(16a) 


(16b) 


E I u"1  (£)  +(P  - ct3)  u’  (£)  = 0 


(17c) 


E I u"(0)  - a2  u’(0)  = 0 (17d) 

E I u"'  (0)  + P u'  (0)  + ax  u(0)  = 0 (17e) 

Now  eqs.(9)  has  become  a special  case  of  eqs.(17)  when  a3  = 0. 

In  addition,  the  problem  defined  by  (6J3)V  =0  of  eqs.  (16)  is  called 
the  adjoint  problem  to  eqs.  (17).  For  a3  = 0,  the  adjoint  problem  is 
identical  to  the  problem  itself  — hence,  the  self-adjoint  system.  Now, 
considering 

a3  = k P (18) 

in  eq.  (17c) , we  have 

E I u'"  (*)  -K  P u'  (£)  =0  (19) 

K = k - 1 (20) 
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Eq.  (19)  defines  the  boundary  condition  of  a general  non-conser- 
vative load.  It  is  also  clear  from  eq.  (19)  that  K is  a dimensionless 
design  constant  which  defines  the  small  angle  between  the  direction  of 
the  applied  load  P and  the  tangent  of  the  deflected  column  at  the  end. 
Since  (6J3)U  = 0 alone  defines  the  boundary  value  problem  of  eq,  (17) 
and  vice  versa,  we  need  not  at  all  to  be  concerned  with  the  adjoint 
problem.  Now  it  is  a simple  matter  to  modify  the  finite  element  matrix 
equation  as 

| K3  + w2M  J U = ° (22) 

where 

K3  = K2  + a3  a' (A)  aT(Jl)  (23) 


4.  FINITE  ELEMENTS  FOR  INITIAL  AND  INITIAL- BOUNDARY  VALUE  PROBLEMS. 

(1)  A Forced  Vibration  Problem.  Let  us  first  consider  a problem 
of  "one"  degree  of  freedom,  i.e.,  a mass-spring  system.  The  differential 
equation  and  initial  conditions  are 


mii  + k u = f(t),  0<t<T 

(24a) 

u(0)  = uQ 

(24b. 

u (0)  = ux 

(24c) 

where  u(t)  is  the  displacement  of  the  mass  centre  from  its  equilib- 
rium position,  m , the  amount  of  mass  and  k , the  spring  constant. 
The  function  f(t)  is  given,  so  are  the  constants  u0  and  u: . The  constant 
T appeared  in  the  bounds  of  eq.  (24a)  is  any  given  positive  number 
other  than  infinity.  In  order  to  formulate  approximate  solutions  for 
eqs.  (24)  the  way  we  did  in  the  previous  section,  let  us  consider  a more 
general  case 


m ii  + k u = f(t) 
u (T)  - ot  [ u(0)  - uQ  ] = 0 
u(0)  = Uj 


(25a) 

(25b) 

(25c) 


where  a is  a parameter,  obviously  eqs.  (25)  reduce  to  (24)  when 

a approaches  to  °°  . Now,  with  eqs.  (25),  we  are  able  to  write  an 

unconstrained  variational  statement  as  follows: 


6 J4  = 0 


(26a) 
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where 


J.  = f [ - muv  + kuv  - f(t)v  ] dt 
^ o 

+ ma  [u(0)  - Uq]  v(T)  - mu^  v(0)  (26b) 

Since 

,T  . . 

J4 )u  = J [ - mu6v  + ku6v  - f(t)  6v  ] dt 
+ ma  [ u(0)  - uQ  ] 6v(T)  - mu^fivCO)  (27a) 

= [ mu  + ku  - f(t)  ] 6v  dt 

- m | u(T)  - a [u(O)  - uQ]  j 6v(T) 

+ m [u(O)  - uj  ] 6v(0)  (27b) 

The  already  familiar  form  of  eqs.  (27)  state  that  (a),  (<5J)U  = 0 is  a 
necessary  and  sufficient  condition  for  eqs.  (25), and  (b),eq.  (27a) 
provides  us  the  finite  element  matrix  equation.  Thus,  if  we  assume  as 
before  that 


u(t)  = aT(t)  U 

v (t)  = aT(t)  V 

Eq.  (27a) 

yields 

<$vT  k4  u = 6vt  f 

(28) 

where 

j 

K4  = J0  ( -m  a aT  + k a aT)  dt 

+ ma  a(T)  aT(0) 

(29) 

and 

i *T5 
II 

T 

/ f(t)  a dt  + mau0  a(t)  + m u0  a(0) 

(30) 

Again,  since  SV  is  unconstrained  eq.  (28)  leads  directly  to 

K4  U = F (31) 

which  is  the  final  equation  to  be  solved. 
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(2)  A Heat  Conduction  Problem.  The  one  dimensional  transient 
heat  conduct  problem  can  be  described  by  the  equation 


9 9u  3u 

3jT  ^ 3x  )“  Pc  3tT  " *(*»*)  s 0 


(32a) 


Boundary  and  initial  conditions  are 


u(0,t)  = g0(t) 
u(L,t)  = g (t) 
u(x,0)  = h (x) 

K = thermal  conductivity 
p = material  density 
c = specific  heat 
f(x,t)  = heat  source  function 
gg(t),  gif*)  and  h(x)  are  prescribed  functions 


where 


Let  us  consider 


(32b) 

(32c) 

(32d) 


6J5  = 0 


(33a) 


- |L  JT  [K  PC  |H-v  + f(x,t)  v]  dt  dx 


3x  3x 


+ /0  aK  [“(LfO  " gi(t)]  v(L,t)  dt 

- J0T  aK  [u(0,t)  - g0(t)]  v (0,t)  dt 

- /flL  pc  [u(x,0)  - h(x)J  v (x,0)  dx 


(33b) 


(6J5)u  =jj\  |H-  6(|1)  + pc  |H.  6v  ♦ f(x,t)  6v]  dx  dt 


+ /„  a K [u(L,t)  - gj(t)]  6v(L,t)  dt 
/oT  a K [u (0,t)  - g0(t)]  6v(0,t)  dt 


i* 


lo  pc  [u (x,0)  - h(x)]  6v(x,0)  dx 


(34a) 


? i 3 iv  3u  * _ „„  3u 


- Ul  57  V 57’  - I*  It  - f(x't,]{v  dxdt 

- J0T  K | - a[u(0,t)-gl(t)]  }fiv(L,t)  dt 

♦ /oT  K | *2*0. t)  - a[u(0,t)  - gQ(t)]  j.6v(0,t)  dt 
L 

+ / pc  [u(x,0)  - h(x)  ] 6v(x,0)  dx  (34t 

o 

it  is  clear  that  (6J5)U  = 0 is  a necessary  and  sufficient  condition 
for  eqs  (32)  as  a and  eq.  (34a)  provides  the  finite  element  ma- 

trix equation.  We  can  write  from  eq.  (34a), 

- tfc  & ‘<S»  * *•  Svl  dxdt 

+ JT  aK  [u (L,t)  6v(L,t)  - u(0,t)  6v(0,t)]  dt 
L 

+ | pc  u(x,0)  6v(x,0)  dt 


= £ltf 6v  dxdt 
T 

+ J#  aK  [gx(t)  6v(L,t)  gQ(t)  6v(0,t)]  dt 
L 

+ / pc  h(x)  6v(x,0)  dx 
■’o 

Now,  let  T 

u(x,t)  = a1 (x,t)  U 

v (x , t ) = aT(x,t)  V 

in  the  usual  manner,  we  have 

6VT  K U = 6VT  F 

,L.T  T T . , 

K = - /J0CK  a>x  ajX  - p c a a>t)  dx  dt 

T 

+ / aK  [a(L,t)  a(L,t)  - a(0,t)  aT(0,t)]  dt 

L T 

+ f pc  a(x,0)  a (x,0)  dx 
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and 


LT 


F * ^£f(x,t)  a(x,t)  dxdt 
+ /,  oY.  [gj(t)  a(L,t)  - gQ(t)  a(0,t)]  dt 
rL 


+ J#  p c h(x)  a(x,0)  dx 

(39) 

Again,  since  6V  in  eq.  (37)  is  completely  arbitrary,  we 
final  matrix  equation  to  be  solved. 

arrive  at  the 

K U = F 

(40) 

(3)  A Wave  Propagation  Problem.  For  a quite  general 
gation  problem,  the  following  system  can  be  written. 

wave  propa- 

!%  - c*  = f (*,t) 

8xz  dt2 

(41a) 

u(0,t)  = gQ(t) 

(41b) 

u(L,t)  = gx(t) 

(41c) 

u(x,0)  = hQ(x) 

(4  Id) 

u(x,0)  = hx(x) 

(41e) 

The  extension  of  the  previous  formulation  to  this  problem  is  straight 
forward.  Let  us  consider 


6J6  = 0 


(42a) 


where 


.L.T 


j6-<rv  t - fe  £ * **  W -I  ^ 

T 

- a J0  [u(L,t)  - gj  (t)]  v(L,t)  dt 
,T 

+ a /o  [u(0,t)  - gQ(t)]  v(0,t)  dt 


a 


Again, 


rLfT  r 3u  3v 


(&j6)u  = /°  Jo  i ■ I7 6(  ^ + c*  If  6 ( It  ^ ■ f(x»t)fivi dxdt 

- a /T  [u(L,t)  - g,(t)]  <5v(L,t)  dt 
0 A 
,T 

+ a /0  [u (0,t)  - g0(t)]  fiv(0,t)  dt 

.1 

-a  [u(x,0)  - hQ(x)]  5v(x,T)  dx 
+ / [u(x,0)  - hj(x)]  6v(x,0)  dx  (4 


L T 

= /Jb  I ^“7  ~ c 2 ^-7  " f(x,t)]  5 v dxdt 

/0,0  2 -14. 2 


+ J { |£(L,t)  - a IU(L^)  - 81  Ct)]  } «v(L,t)  dt 

■ 4 { !£(0,t)  - a [“(O.t)  - g0(t)]  } 6v(0,t)  dt 

+ 4 { " a [u(x»°)  - ^ 00]  | 6v(x,T)  dx 

- / [ |j(x,0)  - h,(x)]  6v(x,0)  dx  (43b) 

V ot 

From  eqs.  (43),  it  is  again  clear  that  ( 6.T 5 D u = 0 is  a necessary  and 
sufficient  condition  for  eqs.  (41)  as  a -*■  00  and  that  eq.  (43a)  will 
yield  the  finite  element  matrix  equation.  From (43a)  one  has: 

u{-  H “S3*  «=*  W 6 ^ } dst 


- a / u(L,t)  6v(L,t)  dt  + a f u(o,t)  6v(0,t)  dt 
t t 

- a / u(x,0)  6v(x,T)  dx 
•V 

= //  f(x,t)  6v(x,t)  dx  dt 


ft 


a /t  5 v(L,t)  dt  + a /t  gQ(t)  6v(0,t)  dt 


- “ J 

h0(x)  5v(x,T)  dx  + £ hx(x)  6v(x,0)  dx 

(44) 

Again,  let 

u(x,t)  = aT(x,t)  U 

(45a) 

v(x,t)  = aT(x,t)  V 

(45b) 

Eq.  (44)  becomes , 

, in  matrix  form. 

T 

6 VT  K U = 6 V F 

(46) 

where 

T I 

e r T 9 • *T 

K = J0  J0  (-  a'  a'  + cz  a a1)  dxdt 

J 

-“4 

a(L,t)  aT(L,t)  dt  + a /J  a(0,t)  aT(0,t)  dt 

- a /„  a(x,t)  aT(x,0)  dt 

(47) 

T L 

F = /„  J0  f(x,t)  a(x,t)  dxdt 

- o J 

T 

fo  gx(t)  a(L,t)  dt  + a /0  gQ(t)  a(0,t)  dt 

+ a 

,L  »L 

4 Vx)  dt  + 4 bjW  dt 

(48) 

Due  to  the  arbitrariness  of  6V  , eq.  (46)  leads  directly  to  the  final 
matrix  equation 


KU=F  (49) 

5.  NUMERICAL  DEMONSTRATIONS.  Several  numerical  examples  will  be 
given  in  this  section  to  demonstrate  the  application  of  the  formulation 
described  so  far. 

(1)  Forced  Vibration.  We  shall  consider  a special  case  of  the 
forced  vibration  problem  formulated  earlier.  The  forcing  function  in 
eqs.  (24)  is  taken  to  be  a cosine  function  thus,  rewrite  eqs.  (24), 

mii  + ku  = fo  cos  u^t  (SOa) 
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u(0)  = u0 
u(0)  = u j 


(50b) 

(50c) 


where  Uq,  Uj,  f q and  o>£  are  given  constants.  In  the  finite  element 
formulation,  we  shall  replace  eqs.  (50)  with  the  following  set 


m ii  + k u = fp  cos  o)£t 
u(t)  - a [u(0)  - u0]  = 0 
u(0)  - Uj  = 0 


(51a) 

(51b) 


thus,  eqs.  (50)  becomes  a special  case  of  (51)  as  a -*■  °°.  It  is  con- 
venient to  nondimensionalize  the  independent  variable  t and  let 


T = t/T 


In  terms  of  t,  eqs.  (51)  become 


where 


••  2 2 

ii  + T <u  u = fj  cos  (T  u)f  t) 
u(l)  - T a [u (0)  - uQ]  = 0 
u(0)  - T uj  = 0 
fj  = T fQ/m  co2  = k/m 


(53a) 

(53b) 

(53c) 


The  exact  solution  for  eqs.  (53)  can  be  easily  written  as 
u(t)  = A cos  (T  id  1 t)  t B sin  (T  w • t) 
+ q cos  (T  u)f  • t) 

with 

n - f°  r Ul 

m (uj2-  u^2)  “ 


a uq  + T Uj  cos  (To))  - q [a  + Ta>£  sin  (Tuif)] 

A = (56) 

a + T a)  sin  (T  to) 

To  solve  eqs.  (53)  using  finite  elements,  one  begins  with  the  variational 
statement: 


6 J = 0 

«J  = / [-  u v + T2  w2uv  - f (t)  v]  dx 


(57a) 


J 


Now  that 


(57b) 


* : 


+ T a [u(0)  - Uq]  v(1)  - T uj  v(0) 


(&J)u  = 0 

1 

= J0  [-  u 6 v + T2  u)2  u6v  - f(x)6  v] 
+ T a [u(0)  - Uq]  v(1)  - TujSvCO) 
= So  [ii  + T2  to2  u - f(x)]  6v  dt 
- {u(0)  - a T[u(0)  - u0]}  6v(  1) 

+ { u(0)  - T ux>  Sv (0) 


(58a) 


(58b) 


(58c) 


From  eq.  (58b),  one  has 

1 

/„  [-  u 6 v + T2u2  u 6 v]  dt  + aT  u(0)  6v(l) 
1 


f(x)  Sv  dx  + a T un  6v(l)  + T Ui  6v(0) 

(59) 

with 

V 

u(x)  = aT(x)  U 

(60) 

v(x)  = aT(x)  V 

eq.  (59) 

leads  to 

<S  VT  K U = 6 VT  F 

or 

K U = F 

(61) 

where 

K = / (-  g aT  + T2  w2  a aT)  dt 

+ a T a(l)  a (0) 

(62) 

1 

F 

= j f(x)  a dx  + aTu0  a(l)  + T Uj  a(0) 

(63) 

The  results  obtained  from  this  finite  element  formulation  are  com- 
pared with  the  exact  solutions  as  shown  in  Tables  1-3.  The  values  of 
the  parameters  chosen  for  these  data  are  k = 1.0,  m = 1.0,  fg  = 1.0, 
u)£  = 0.5,  uq  = 1.0,  Uq  = 1.0  the  number  of  elements  used  is  ten.  The 
calculated  u and  u for  T = 2.0,  10.0  and  20.0  are  given  in  Table  1, 

2,  3 and  4 respectively.  The  forcing  function  cos  Uft  and  the  solu- 
tion u(t)  are  also  plotted  in  the  range  0 < t < 20  as  shown  in  Figure  1. 

(2)  Solutions  to  a Transient  Heat  Conduction  Problem.  As  another 
numerical  example,  we  shall  take  the  nondimensional  heat  transfer  problem 
defined  by  the  following  set: 


1 


1 


TABLE  1 


r ’ 

Solutions  to  the  Forced  Vibration  Problem  Using  FE-UVF 
Compared  with  Exact  Solutions  (in  Parentheses) 

0 < t < 2.0 


1 

t 

u(t) 

Exact 

u(t) 

Exact 

tj 

n 

1.000 

000 

0 

(1.000 

000 

0) 

1.000 

00 

(1.000  00) 

Pi 

1.198 

652 

6 

(1.198 

652 

7) 

0.979 

74 

(0.979  73) 

I' 

1.389 

153 

7 

(1.389 

153 

4) 

0.918 

43 

(0.918  42) 

1.563 

313 

2 

(1.563 

312 

6) 

0.816 

55 

(0.816  54) 

0.8 

1.713 

202 

9 

(1.713 

201 

8) 

0.676 

22 

(0.676  21) 

r A 

1.0 

1.831 

481 

7 

(1.831 

480 

3) 

0.501 

18 

(0.501  18) 

B 

1.911 

702 

4 

(1.911 

700 

6) 

0.296 

62 

(0.296  61) 

t 

B 

1.948 

585 

6 

(1.948 

583 

6) 

0.068 

98 

(0.068  97) 

1.938 

251 

2 

(1.938 

249 

1) 

- 0.174 

24 

(-0.174  25) 

1.8 

1.878 

396 

9 

(1.878 

395 

0) 

- 0.424 

82 

(-0.424  80) 

. \ 

2.0 

1.768 

416 

1 

(1.768 

416 

1) 

- 0.674 

13 

(-0.674  03) 

TABLE  3 

Solution  to  the  Forced  Vibration  Problem  Using  FE-UVF 
Compared  with  Exact  Solutions  (in  Parentheses) 


0 < t < 20.0 


3 


D.E.: 

fLH.  - = o , 0<x<l  ; 0<t<T 

9x2  9t 

(64a) 

B.C. : 

u(0),t)  - l,  2“  (l.t)  = o 

3x 

(64b, c) 

I.C.: 

u(x,0)  = 0 

(64d) 

where  T is  any  given  finite  real  positive  number.  To  facilitate  compu- 
tation, it  is  desirable  to  change  the  independent  variable  t into  x 
such  that 

x = t/T  (65) 

thus,  the  system  of 

eqs.  (64)  becomes 

D.E. : 

if"  - 12"  = o , 0<x<l  ; 0<T<1 

9x2  T 3x 

(66a) 

B.C.  : 

u (0 , t)  = 1 ; g-  (1 , t)  = 0 

(66b, c) 

I.C.: 

u(x,0)  = 0 

(66d) 

According  to  our  unconstrained  variational  formulation,  this 
again  replaced  by  the  following: 

system  is 

D.E.  : 

^7  - \ ~ = 0 , 0<x<l  ; 0<t<1 
9x2  T gT 

(67a) 

B.C. : 

(0,t)  + a [u(0,t)  -1]  = 0 

(67b) 

£ (i.-o  * o 

(67c) 

I.C.: 

u(x,0)  = 0 

(67d) 

Clearly,  eqs.  (67)  reduces  to  (66)  as  a -»■  “ . The  variational  state- 
ment can  be  written  as 

6J  = 0 (68a) 

Where  11  1-5 

t _ r t , 3u  3v  . 1 3u 


J = - / J (££  + ii 1 v)  dxdt 


9x  9x  T 9x 


+ a/Q  [u (0,x)  - 1]  v (0,t)  dx 
1 

+ L U(x,0)  v(x,0)  dx 


(68b) 


Due  to  the  fact  that  v(x,t)  is  unconstrained,  it  is  a simple  matter  to 
show  that 

(6J)U  = 0 (69) 


Solu 
Les  Si 


1.000  0.754  0.583  0.370  0.264  0.228 

(1.000)  (0.757)  (0.496)  (0.405)  (0.284)  (0.179) 


1.000 


1.000 


1.000 


0.855 


0.713 


0.622 


0.552 


0.910 


0.828 


0.767 


0.725 


0.945 


0.896 


0.857 


0.832 


0.516 


(1.000)  (0.853)  (0.721)  (0.616)  (0.549)  (0.526 


0.708 


(1.000)  (0.910)  (0.830)  (0.767)  (0.724)  (0.710) 


0.823 


(1.000)  (0.945)  (0.896)  (0.857)  (0.832)  (0.823) 


1.000 


0.967 


0.937 


0.913 


0.897 


0.892 


(1.000)  (0.967)  (0.937)  (0.913)  (0.897)  (0.892) 


TABLE  5 


Transient  Heat  Transfer  Solutions  u(x,t)  Using  FE-UVT 
Compared  with  Exact  Series  Solutions  (in  Parentheses) 

0 < t < T - 0.05 


Si 

0 

0.2 

0.4 

0.6 

0.8 

0.01 

1.000 

0.014 

0.002 

0.000 

0.000 

(1.000) 

1 

(0.005) 

(0.000) 

(0.000) 

(0.000) 

0.02 

1.000 

0.315 

0.047 

(0.003) 

(0.000) 

(0.000 

(1.000) 

(0.317) 

(0.046) 

(0.003) 

(0.000) 

(0.000) 

0.03 

1.000 

0.413 

0.103 

0.015 

0.001 

0.000 

(1.000) 

(0.414) 

(0.102) 

(0.014) 

(0.001) 

(0.000) 

0.04 

1.000 

0.479 

0.157 

0.034 

0.005 

0.001 

(1.000) 

(0.480) 

(0.157) 

(0.034) 

(0.005) 

(0.001) 

0.05 

1.000 

0.527 

0.206 

0.058 

0.012 

0.003 

(1.000) 

(0.527) 

(0.206) 

(0.058) 

(0.012) 

(0.003) 

\ 

«'* 

; » i 

i ** 

, i 


I * 


is  a necessary  and  sufficient  condition  of  eqs.  (67).  Now  the  finite 
element  matrix  equations  can  be  obtained  from  eq.  (69). 

cfij)u  = - /V  [—«(—) + 1 — w dxdt 

u <>  o 3x  3x  T 3x 
1 

+ / [u (0 , t)  - 1]  6v(0,x)  dt 
1 

+ / u(x,0)  <Sv(x,0)  dx  = 0 (70) 

0 

or, 

- /V  [!“  6(3v3  + I 3“  5v]  dxdt 
Vo  l3x  3x  T 3t 

,1  * 

+ a { U(0,T)  6v(0,t)  dt  + J u(x,0)  Sv(x,0)  dx 

0 0 

= a / 6v(0,x)  dx  (71) 

Using  the  usual  procedure  of  discretization  and  the  assumption 
of  displacement  functions,  the  final  finite  element  matrix  equation 
evidently  can  be  derived  from  eq.  (71).  We  shall  omit  the  details  here. 
The  computational  results  are  presented  in  Tables  4 and  5.  The  finite 
element  grid  scheme  used  is  shown  in  Figure  2.  As  clearly  shown  in 
those  tables,  excellent  agreement  exists  between  the  FE-UVF  approach 
and  the  series  solution.  It  is  noted  that  the  approximate  solutions 
are  less  accurate  invariably  as  they  approach  the  initial  time  t = 0. 
This  is  probably  due  to  the  discontinuity  of  the  initial  boundary  data 
at  x = 0,  t = 0. 
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BY  MATHEMATICAL  PROGRAMMING 
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1.  INTRODUCTION 

This  paper  is  concerned  with  the  numerical  solution 
of  free  boundary  problems  by  mathematical  programming. 

In  such  problems,  one  seeks  the  solution  of  a partial 
differential  equation  (usually  Laplace's  or  Reynolds' 
equation)  satisfying  prescribed  conditions  on  the 
boundary  of  a region  when  a portion  of  the  boundary  is 
unknown  and  must  be  determined  as  part  of  the  problem. 
The  unknown  boundary  is  called  the  free  boundary. 

Many  of  these  boundary  value  problems  have  not 
yielded  to  analytical  methods  of  solution.  Recently, 
however,  a novel  transformational  approach  has  met 
with  more  success.  Specifically,  the  free  boundary 
problem  is  reformulated  as  a variational  inequality 
which,  in  turn,  is  equivalent  to  a certain  constrained 
minimization  problem  in  a Sobolev  (function)  space. 
Although  this  latter  problem  is  still  computationally 
intractable,  finite  difference  or  finite  element 
approximations  yield  a difficult,  but  solvable,  sparse, 
specially-structured  quadratic  programming  problem  of 
potentially  very  large  size.  It  is  the  solution  of 
this  last  problem  with  which  we  are  concerned  and  for 
which  an  algorithm  will  be  stated. 

2.  APPLICATIONS 

Free  boundary  problems  arise  in  a variety  of 
situations.  Rohde  and  McAllister  [8]  have  developed 
the  variational  inequalities  for  the  finite-length 
journal  bearing  problem,  in  which  one  is  concerned 
with  a cylindrical  rod  (the  journal)  rotating  within  a 
tube  (the  bearing) . The  inner  surface  of  the  bearing 
is  coated  with  a thin  film  of  lubricant  and  we  wish 
to  know  the  pressure  distribution  on  the  film.  At  a 
certain  point,  the  pressure  becomes  so  low  that  the 
lubricant  vaporizes,  thus  creating  the  free  boundary 
interface. 
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In  the  area  of  fluid  dynamics,  Baiocchi  et  al.  [1] 
have  reformulated  certain  problems  dealing  with 
stationary  fluid  flow  through  porous  media  as  varia- 
tional inequalities.  These  include  porous  dams  in 
which  the  free  boundary  is  the  interface  between  the 
wet  and  dry  part  of  the  dam.  Brezis  and  Stampacchia 
[2] , [3]  have  studied  the  determination  of  steady  sub- 
sonic flows  for  nonviscous  compressible  and  incompress- 
ible fluids  past  a two-dimensional  convex  body  by 
using  a hodograph  transformation  to  obtain  an  equiva- 
lent free  boundary  problem  for  which  a variational 
inequality  problem  can  be  stated. 

3.  THE  QUADRATIC  PROGRAMMING  PROBLEM 


The  common  denominator  of  these  and  several  other 
free  boundary  problems  is  that  their  associated 
quadratic  programming  problem 

Minimize  f (x)  = j<x,Mx>  + <q,x> 

subject  to  x > 0 

has  certain  special  attributes  which  can  be  exploited  in 
the  development  of  efficient  algorithms.  The  matrix  M 
is  a block-tridiagonal  Stieltjes  matrix  (ie.,  symmetric, 
diagonally  dominant  with  nonpositive  off-diagonal 
entries) . Furthermore,  the  diagonal  blocks  are  them- 
selves tridiagonal  matrices  and  the  off-diagonal  blocks 
are  diagonal  matrices. 

One  computationally  successful  .approach  to  this 
problem  is  a modification  of  the  block-  (or  line-) 
successive  overrelaxation  method.  This  algorithm 

requires  that  we  partition  the  vector 

n | 

x = (x1,x2, . . . ,xm)  where  x^  e R and  conformably 

partition  M and  q.  For  this  special  class  of  problems, 
we  may  state  the  algorithm  as  follows: 


Algorithm 


Step  0.  Let  x = 


, 0 0 0.  . 

(x, , x_,  . ..,  x ) be  any  nonnegative 
1 z m 


vector,  eg.,  x =0.  Let  we (0,2)  be  given.  Set  k=0  and 


~k+l 

Step  1.  Determine  x^  ^ 0 which  minimizes  (over  the 

niv 

nonnegative  orthant  R ) 

e . k+1  k+1  k+1  „ k k. 

f (x1  , X2  » • • • » v , • • • , x^) 

1 k+1  k 

= -r-  <V,M.  .v>  + < (q . + M.  . x.  . + M.  ...  X..-),  v>+c. 

2 ll  ^l  i,i-l  l-l  1,1+1  1+1  l 


where  may  be  taken  to  be  zero. 


Step  2.  Define 


uk+1  = max  {00  : a)  < a),  x^  + a>(x^+^  - x*)  > 0 
1 — 1 1 1 — 

k+1  k . k+l.-k+l  k. 

X.  = x±  + (x±  - X.)  . 

Step  3.  If  i=m,  go  to  Step  4.  Otherwise,  return  to  Step  1 


with  i replaced  by  i+1. 


Step  4.  Define 


m 

S = { (i,  j)  : (x*+1)  j>0}U{(i,j)  : (x*+1)  ^=0,  (q^f  Mi£x*+1)  <0} 

m 

If  max  | (q.  + X M. 0x^+^) .|  < e,  stop.  An  approximate 
(i,  j ) eS  1 £=l  1)6  * ^ “ 

solution  is  at  hand.  If  not,  return  to  Step  1 with  k 

replaced  by  k+1  and  i=l. 

Step  1 requires  that  we  solve  a smaller  quadratic 
programming  problem  whose  quadratic  form  contains  a 
tridiagonal  Stieltjes  matrix.  For  a discussion  of  some 
fast  methods  to  do  this,  we  refer  the  reader  to  [5].  For 
more  details  on  the  development  of  and  computational 
experience  with  the  algorithm  given  above,  see  [4] . 

From  a consideration  of  storage  requirements  and  speed, 
one  may  conclude  that  this  algorithm  is  competitive,  if 
not  superior,  to  other  methods  described  in  the  literature. 


4 . REFERENCES 


[1]  C.  BAIOCCHI , V.  COMINCIOLI,  L.  GUERRI  and  G.  VOLPI , 

"Free  boundary  problems  in  the  theory  of  fluid 
flow  through  porous  media  : A numerical  approach", 
Calcolo  10  (1973),  1-86. 

[2]  H.  BREZIS,  "A  new  method  in  the  study  of  subsonic 

flows",  Lecture  Notes  in  Mathematics  No.  446 
(J.  Goldstein,  ed.)  Springer-Verlag , New  York, 

1975. 

[3]  H.  BREZIS  and  G.  STAMPACCHIA,  "The  hodograph  method 

in  fluid-dynamics  in  the  light  of  variational 
inequalities",  to  appear  in  J.  Arch.  Rat.  Mech. 

Anal. , 1976. 

[4]  R.  W.  COTTLE,  G.H.  GOLUB  and  R.S.  SACHER,  "On  the 

solution  of  large,  structured  linear  complementarity 
problems  : The  block-tridiagonal  case".  To  appear. 

[5]  R.W.  COTTLE  and  R.S.  SACHER,  "On  the  solution  of 

large,  structured  linear  complementarity  problems  : 
The  tridiagonal  case".  To  appear. 

[6]  C.W.  CRYER,  "The  method  of  Christopherson  for 

solving  free  boundary  problems  for  infinite 
journal  bearings  by  means  of  finite  differences". 
Math,  of  Comp.  2jj  (1971),  435-443. 

[7]  C.W.  CRYER,  "The  solution  of  a quadratic  programming 

problem  using  systematic  overrelaxation",  J.  SIAM 
Control  9 (1971),  385-392. 

[8]  S.M.  ROHDE  and  G.T.  McALLISTER,  "A  variational 

formulation  for  a class  of  free  boundary  problems 
arising  in  hydrodynamic  lubrication",  Int.  J. 

Eng.  Sci  13  (1975),  841-850. 


A NUMERICAL  INTEGRATION  ERROR  ANALYSIS 
UTILIZING  A WRONSKIAN  TECHNIQUE 


Larry  A.  Whatley 

Quality  and  Reliability  Division 
USAMERADCCM 

AITN:DRXFD-TQ  (Whatley) 

Ft.  Belvoir,  Virginia  22060 
Formerly,  Intern  Training  Center,  DARGQM 


S.  Bart  Childs,  Ph.D. 


Department  of  Industrial  Engineering 
Texas  ASM  University 
Texarkana,  Texas 


ABSTRACT.  An  error  analysis  is  performed  upon  the  superposition  and  numerical 
integration  procedures  of  a method  of  solution  of  multipoint  boundary  value  problems 
utilizing  power  series  expansions . The  procedure  involves  the  evaluation  of  the  re- 
lative error  of  the  Wronskian,  which  provides  a scalar  function  characterization  of 
the  error  of  integrators  of  a matrix  of  solutions.  The  error  behavior  is  investigated 
by  using  different  integration  step  sizes  and  orders  (terms  of  the  power  series) . 

Evaluations  are  performed  with  numerical  solutions  of  specified  accuracy  or 
order.  Example  applications  are  included. 

1.  INTRODUCTION.  Numerical  integration  is  ccmronly  used  by  engineers  and 
scientists  as  a tool  for  solving  ordinary  differential  equations.  Those  equations 
which  cannot  be  solved  exactly  or  in  closed  form  can  often  be  solved  using  numerical 
integration  techniques.  There  are  drawbacks  to  each  particular  integration  scheme. 

The  most  important  considerations  are:  the  origin  of  the  problem,  guidelines  from 

the  theory  of  the  algorithm,  the  computer  being  used,  and  the  class  or  problems  to 
be  considered,  Shampine  and  Allen  (1973) . 

Many  techniques,  in  the  form  of  "canned"  routines  or  pre-progr aimed  methods, 
and  their  variations,  are  available  to  the  user.  It  is  now  possible  to  obtain  numeri- 
cal solutions  using  techniques  which  require  lengthy  operations.  The  more  popular 
integration  techniques  (i.e.  Adams  methods,  Runge-Kutta,  etc.)  provide  reasonable 
results  for  a wide  range  of  applications.  They  are  subject  to  seme  disadvantages,  the 
most  cannon  being  their  susceptability  to  round-off  error,  Ralston  and  Wilf  (I960) . 
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An  alternate  method  of  numerical  integration  has  been  investigated  which  is 
based  upon  the  expansion  of  power  series.  The  method  is  relatively  free  of  tie 
disadvantages  of  the  more  popular  techniques  and  significantly  more  efficient  for 
certain  classes  of  problans,  Doiron  (1967) . Research  done  previously  by  Fehlberg 
(1964)  has  shown  the  power  series  technique  to  be  five  to  six  times  faster  than  a 
Runge-Kutta  method  for  the  same  specified  accuracy  in  certain  selected  problems. 

The  power  series  methods  generally  require  more  user  effort. 

The  purpose  of  this  study  is  to  investigate  the  error  in  the  integration  via 
power  series  expansions.  It  has  been  shown  that  tie  Wronskian  can  be  used  as  a 
meaningful  check  on  the  solvability  and  superposition  procedures  in  the  solution 
of  boundary  value  problems.  It  has  been  proposed  that  the  relative  error  of  the 
Wronskian  can  provide  some  insight  into  the  errors  arising  from  this  particular 
integration  scheme,  Childs  et  al.  (1971). 

2.  DEVEIiOPMENr . The  problems  to  be  considered  are  presented  as  an  ordinary 
differential  equation  written  in  the  general  linear  form  as 

V = + f (2-1) 

where  L is  a linear  operator  in  the  form  of  an  n x n coefficient  matrix  (expressed 
as  a constant  or  function  of  an  independent  variable) . The  letter  y represents  the 
state  variable  vector  and  y denotes  the  derivative  of  y with  respect  to  the  indepen- 
dent variable  (in  this  case  t) . The  vector  f is  a vector  of  forcing  functions.  The 
above  equation  is  subject  to  a set  of  specified  boundary  conditions. 

qi(y(ti))=bi  0 <_ti  <T  i = 1,2, . . .,m  (2-2) 

where  m >_n.  The  operator  is  a linear  combination  of  the  elements  of  the  vectors 
at  t = t . , that  is  equal  to  the  boundary  value  b ■ . 

V 1, 

To  meet  the  above  boundary  condition  it  is  necessary  to  superimpose  independent 
solutions  of  equation  (2-1) . The  technique  used  is  to  superimpose  the  appropriate 
number  of  solutions  of  the  homogeneous  equation 

H = LH  (2-3) 

upon  a particular  solution 

P * Lp  + f . (2-4) 

This  can  be  written  as 


y=p  + H8=p  + l h(k>  3,  r <_  n (2-5) 

k=l  K 

where  H is  a matrix  whose  columns  are  homogeneous  solutions.  The  superscript  in 
parentheses  indicates  that  vector  is  the  ( • ) ^ column  of  a matrix  denoted  by  the 
capital  letter  and  3 denotes  the  superposition  constants.  The  letter  r denotes 
the  number  of  homogeneous  equations  which  is  equal  to  the  number  of  unknown  elements 

of  y(o). 
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It  is  known  from  elementary  differential  equation  techniques  that  the  sum  of 
a particular  solution  of  a linear  differential  equation  and  a solution  of  its  homo- 
geneous differential  equation  is  merely  another  particular  solution  of  that  differ- 
ential equation. 

Utilizing  this  fact,  it  can  be  established  that  y can  be  expressed  as  a combi- 
nation of  particular  solutions  (Childs,  1971) 

y = Pa  = l p(k)  a 7,  (2-6) 

k=0  K 

where  P is  a matrix  whose  columns  are  solutions  of  equation  (2-1) , thus: 

p(k)  = Lp(k)  + f . (2-7) 

We  multiply  each  side  of  equation  (2-7)  by  ak  and  sum  these  products 


l P(k)  ak=  L l P(k)  ak+  f l ak  • (2-8) 

k=0  k=0  K k=0 

By  comparing  equation  (2-1)  with  equation  (2-8) , it  is  obvious  that  the  left  hand 
side  of  equation  (2-8)  is  the  quantity  y and  the  first  term  of  the  right  hand  side 
is  the  state  vector  y.  Therefore,  it  is  elementary  that  the  superposition  constants 
must  obey 

r 

l a,  = 1 . (2-9) 

k=0  k 

After  determining  the  superposition  constants,  subject  to  the  above  restriction, 
the  solution  becomes  trivial  and  is  generated  utilizing  the  initial  conditions 

y(o)  = P(o)  a . (2-10) 

The  reason  for  superposition  of  solutions  is  to  satisfy  the  boundary  conditions. 
It  is  necessary  that  the  superimposed  solutions  be  independent  to  be  able  to  meet 
boundary  conditions . 

Ths  requirement  of  independence  is  satisfied  using  a determinant  of  homogeneous 
solutions,  which  is  usually  known  as  the  Wronskian.  The  independence  of  homogeneous 
solutions  is  satisfied  when  the  matrix  whose  columns  are  these  vectors  of  rank  r. 

rank  (H)  = r (2-11) 

which  must  contain  at  least  one  r x r submatrix  of  H and  has  a non-zero  determinant 
for  the  range  of  values  of  the  independent  variable. 

This  can  be  applied  to  superposition  of  particular  solutions.  Define  VP  as  an 
( n+1 ) x (r+1)  matrix  in  which  the  first  row  elements  are  one  (unity)  and  remaining 
submatrix  is  P (shown  in  equation  2-6) . 
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"P  = 


LLi.i.s.1 

p 


(2-12) 


The  Wronskian  of  rank  n has  been  shown  to  obey  the  following  equation, 
Petrovski  (1966) . 


det  (H(t) ) = det  (H(o))  exp  ( j tr  (L(i))  di) 

o 


(2-13) 


where  tr  (L(®))  is  the  trace  (the  summation  of  the  principle  diagonal  of  tlie  matrix) 
°f . the  aoefficient  matrix  in  equation  (2-1).  When  the  Wronskian  is  non-zero  at  the 
initial  value  of  the  independent  variable,  then  it  is  theoretically  non-zero  for  all 
values  of  the  independent  variable  over  any  finite  interval.  The  following  theorem 
adapts  (2-13)  to  particular  solutions. 

Theorem: 


t 

det  ('P(t))  = det  ('P(o))  exp  C f tr  ( L( tj)  dij 

o 


Proof: 


(o) 


The  columns  of  “P  are  p ' and 

p(k)  m p(o)  + h(k) 


(2-14) 


(2-15) 


The  subtraction  of  one  column  of  a matrix  from  all  other  columns  does  not  affect 
the  value  of  the  determinant  of  that  matrix.  Therefore,  subtracting  the  0th  column  of 
'P  from  all  other  columns  and  comparing  with  equation  (2-13)  completes  the  proof: 


det  rp)  = det 


l _ 1_  _ _. 

o I 

„ I H 

L°  I 


. 1 


= det  (H) 


(2-16) 


The  Wronskian  shews  that  solutions  are  (not)  linearly  independent  and  that  a 
fundamental  set  of  solutions  (doesn't)  exist. 


The  relative  error  of  the  Wronskian  is  defined  as  follows: 


R(t) 


= \\W(t)\  - | Wn(t) 

\W(t)\ 


(2-17) 


where  Wn  (t)  is  evaluated  utilizing  particular  solutions  which  come  frem  numerical 
integration  procedures  and  W (t)  is  evaluated  frcm  (2-14). 

3.  AN  EXAMPLE.  The  pcatfsr  series  integration  method  was  programmed  in  FORTRAN 
utilizing  an  Amdahl  470  digital  computer.  The  program,  subroutines  and  function 
routines  used  in  the  study  were  provided  from  unpublished  studies,  Childs  (1975)  . 

The  results  are  for  damped,  forced  harmonic  oscillators  described  by  the 
following  equations: 
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h2  = ~^2  ~^2  + sin(t} 


A set  of  independent  particular  solutions  are  created  using  arbitrarily  chosen 
initial  conditions: 


Power  series  evaluations  are  then  generated  using  these  initial  conditions  and 
the  recursive  relationships.  The  set  of  particular  solutions  are  then  solved  over 
the  range  of  the  independent  variable,  i. 

By  calculating  the  Wronskian  using  both  numerical  procedures  and  the  analytical 
method,  the  relative  error  may  be  examined. 

The  results  of  the  relative  error  of  the  power  series  integration  procedure 
are  compared  to  results  obtained  from  previous  studies  by  Childs  et  al.  (1971)  con- 
cerning the  same  problan  using  two  different  numerical  integration  procedures  with 
X - 1.0  and  p = 0.2.  The  two  integration  procedures  used  to  oonpare  with  the  power 
series  method  are  modified  Euler  end  Runge-Rutta  methods.  They  are  order  and  h 4 
respectively,  where  h is  the  integration  step  size.  The  two  plots  in  Figure  4-1 
are  log- log  plots  of  R(t)  versus  h for  the  Euler  and  Runge-Kutta  procedures.  For 
these  results  it  has  been  observed  that  the  following  relationship  is  true: 

y(R(t) ) = R(yt) 

where  y is  a positive  scalar  quantity.  Fran  these  results  it  has  also  been  suggest- 
ed that  the  relative  error  is  dominated  by  the  following  proportionality  for  "reason- 
able" integration  step  sizes 

| R(t) | « h?  t 

where  j is  the  order  of  the  integration  formula  used. 

By  comparing  both  cases  (Figure  4-1. a and  4-1 .b)  it  has  been  determined  that 
they  have  slopes  of  2 and  4 respectively.  It  has  also  been  observed  that  for  "large 
step  sizes  the  points  tend  away  from  the  straight  line  due  to  approximation  error  and 
also  for  using  "small"  step  sizes  due  to  round  off  error. 

Results  for  the  power  series  integration  procedure  are  presented  in  Figure  4-2 
in  the  form  of  a log- log  plot  of  R(t)  versus  h for  different  orders  (terms  in  the 
power  series).  It  is  seen  that  a family  of  curves  exist  for  different  orders.  It 
was  observed  that  for  constant  step  sizes  the  error  decreases  as  the  number  of  terms 
increase.  As  the  step  size  increases,  the  number  of  terms  must  also  increase  in 
order  to  retain  a specified  accuracy.  Like  the  Euler  and  Runge-Kutta  procedure,  the 
relative  error  tends  toward  linearity  as  it  increases  with  step  size.  As  the  step 
size  decreases  for  each  "order  curve"  the  error  function  tends  toward  the  error 
f specification.  This  observation  can  be  explained  by  the  evaluation  subroutine  used 

\*  on  accuracy  specification  of  1 x 10“7.  Thus,  more  accuracy  was  not  attempted. 


Since  the  curves  are  all  similar,  results  will  be  explained  for  only  one  curve. 
For  the  "order  curve"  evaluated  at  4 terms,  the  step  size  begins  at  .001.  It  is 
observed  to  be  within  accuracy  specifications  due  to  the  fact  that  the  power  series 
integration  is  performed  with  such  small  step  sizes.  Since  such  small  steps  are 
used,  all  the  terms  (in  this  case,  4)  are  not  required  to  meet  the  accuracy.  As  the 
step  size  increases,  more  terms  are  required  to  meet  the  accuracy  specification.  At 
the  step  size,  .005,  it  is  observed  that  the  curve  "dips".  This  occurs  because  at 
this  step  size  mere  terms  (in  this  case,  1)  are  required  to  meet  the  accuracy.  From 
this  point  on  the  routine  is  utilizing  all  the  terms  of  the  power  series  in  order 
to  meet  the  accuracy  requirement.  However,  as  the  step  size  increases,  it  is  seen 
that  the  accuracy  is  not  being  met  due  to  the  larger  steps  being  taken.  It  is  also 
seen  that  the  error  function  is  linear  while  all  terms  of  the  power  series  are  being 
used  ard  would  continue  to  be  linear  (within  machine  limitations)  if  it  were  not 
for  round-off. 

Results  were  also  calculated  for  several  values  of  ( A , p) . All  tendencies  held 
as  shewn  in  Figure  4-2. 

4.  CONCLUSIONS . The  relative  error  of  the  Wronskian  can  apparently  be  used 
to  determine  if  the  step  size  used  by  an  integration  procedure  is  appropriate.  The 
error  would  grow  approximately  linearly  in  a log-log  plot.  Further  investigations 
should  involve  different  systems  of  equations. 
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INPUT  CONTROLLABLE  STOCHASTIC  MODEL 


Sheafen  Frank  Kuo 
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1.  INTRODUCTION.  This  paper  introduces  a model  which  incorporates 
the  principles  of  both  Markov  chains  and  finite  state  machines.  Markov 
chains  possess  stochastic  behavior  in  the  transition  between  states  but 
are  not  input  controllable.  Finite  state  machines,  on  the  other  hand, 
are  input  controllable  between  states,  but  do  not  have  stochastic 
behavior.  Basic  concepts  of  an  input  controllable  stochastic  model  and 
analysis  of  its  short-  and  long-term  behaviors  are  presented.  Forecast 
accuracy  (FA)  of  a model  is  defined  and  relations  between  strings  and 
models  are  described.  The  first  order  derivative  ( FOD)  of  a model  is 
introduced.  A sufficient  condition  for  a model  and  its  FOD  to  have 
equal  FA  is  proved.  In  addition,  some  applications  are  briefly  dis- 
cussed. 

2.  INPUT  CONTROLLABLE  STOCHASTIC  MODEL. 

A.  Definition.  An  input  Controllable  Stochastic  Model  (ICSM)  is  a 
quadruple  H = {I,  0,  S,  y}  where  I is  the  input  set,  S is  the  state  set, 
0 is  the  output  set,  and  y is  a probabilistic  function,  such  that 

y:  I x St  x 0 x St+1  - P 

where 

St,  St+1  c s 

P = the  set  of  real  numbers  between  0 and  1 . 

In  other  words,  given  input  x^  and  present  state  S^,  y assigns  a 
probability  P^mn  to  each  output  ym  and  next  state  Sn.  Using  the  prop- 
erty of  the  probability  function  gives 

E E for  all  x.el,  S.eS 

xeO  $ eS  ,Jmn  ' J 

m n 

B.  Example.  Let  I = 0 = {x,  y},  S = (A,  B,  C).  y is  defined  as 
follows: 
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y(x.  A,  x.  A)  = 1/4 
y(x,  A,  y,  B)  = 1/4 
y(x.  A,  x,  C)  = 1/2 
y(y»  A,  y,  c)  = l 
y(x,  B,  x.  A)  = 1/6 
y(x,  B,  y.  A)  = 1/6 
y(x,  B,  x,  C)  = 2/3 
y(y,  b,  x,  B)  = l 
y(x,  C,  y.  A)  = 1/2 
y(x,  C,  y,  B)  = 1/2 
y(y»  c»  x,  c)  - l 
y * 0 otherwise 

C.  Graphic  Notation.  Noting  the  input,  output,  and  probability 
on  an  arc  path  between  two  states  S.  and  S.  gives  the  following: 

J 


x/y,  1/2 


This  notation  means  that  given  input  x and  current  state  S. , the 

probability  of  getting  the  next  state  S.  and  output  y is  1/2.  Making  an 

arc  between  each  communicable  state  would  give  a flow  graph  for  that 
model.  The  flow  graph  of  the  last  example  is  shown  as  follows: 


x/y,  1/6 
x/x,  1/6 


y/x,  1 
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3.  INPUTTABLE  MARKOV  CHAIN  (IMC).  A special  case  of  ICSM  of  inter- 
est in  this  paper  is  the  model  with  empty  output  set  0.  This  kind  of 
model  is  called  the  Inputtable  Markov  Chain  (IMC). 

A.  Definition.  An  IMC  is  a triple  G = {I,  S,  k}  where  I is  the 
Input  set,  S is  the  state  set,  and  k is  a probability  function  which 
satisfies: 

k(S^,  x j , Sk)  = prob  (St+1  = | St  = is  input}  for  all 


Hence 


Sj,  S^eS  and  XjeI 


E k(S.j,  Xj,  Sk)  = 1 for  all  S*eS  and  x^el 


B.  Transition  Matrix.  Let  K = {I,  S,  k}  be  an  IMC,  where  I = 
{x^ , X2  • • • xn)  and  S = {S-j,  $2*  « • . S^}.  Using  P.jjk  = k(S^,  Xj,  S^) 

gives: 

1 2 • ‘ * m 

1 / Pljl  Plj2  * ' * Pljm  \ 

2 I P2jl  P2j2  * • * P2jm  \ 


m ' Pmjl  Pmj2  Pmjm  / 

as  a conditional  transition  matrix  of  input  Xj.  Notice  that  the  summa- 
tion of  each  row  is  1. 

Suppose  at  each  state  the  probability  of  getting  input  x^  is  q^. 


be  a diagonal  matrix 


Q 


Let 


r. 


1 * < 
f '' 

! -v 


n 

i > 

V 


PJ  ■ • P*J 

(2) 

n 

P = Z P. 

j=l J 

(3) 

Thus,  P is  a transition  matrix  without  knowing  an  input  variable. 


4.  SHORT-  AND  LONG-TERM  BEHAVIOR. 

A.  Short-Term  Behavior.  From  a given  model  one  can  explore  the 
k-step  state  distribution;  i.e.,  after  the  k-step,  the  model  will  go  to 
a certain  state  with  a certain  probability.  Two  cases  can  be  con- 
sidered: 

(1)  Input  string  is  given.  If  x-^  ...  x^  is  the  input  string, 
then  the  k-step  conditional  transition  matrix  is 

PxjXg  . . . x^  ■ PXj  • Pxg  . . . Pxk  (4) 

where  Px.  is  the  conditional  transition  matrix  defined  by  equation  1. 

J 

(2)  Input  string  is  not  given,  but  the  input  distribution  matrix 

Q.  is  given.  Equations  2 and  3 can  then  be  used  to  find  P,  and  the 
J 

k-step  transition  matrix  is  as  shown  in  equation  5. 

pk  = P • P . . . P (5) 

v V ' 

k times 

B.  Long-Term  Behavior.  For  long-term  analysis,  only  the  case 

without  input  is  considered  here.  If  k is  large,  calculating  P is 
somewhat  cumbersome,  but  applying  the  z- transformation,  which  is  a com- 
mon way  of  calculating  the  power  of  a stochastic  matrix,  simplifies  it. 
Let 

q(k)  = Pk 

The  z- transformation  Q(z)  of  q(k)  is  defined  as: 

Q(z)  = Z q(k)z"k  (6) 

k=0 
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or 

Q(z)[I  - z_1P]  = q (0 ) 

(7) 

since 

II 

O 

c r 

Hence 

Q(z)  = 0 - z '1P]_1 

(8) 

Let  the  inverse 

transform  of  Q(z)  = y (k ) . 

Therefore 

y(k)  = 3_1[(i  - z'V1] 

(9) 

q(k)  = y(k) 

00) 

or 

y(k)  = pk 

on 

From  equation  11 

9 

lim  pk  = lim  y(k) 
k -►  °°  k 

5.  STATE  PROBABILITY  AND  FORECAST  ACCURACY  OF  A MODEL. 

A.  Definition.  State  probability  (or  state  frequency)  P..  of  state 


is  defined  as 


where 


P1  = J PJPji 

Pji  = l qjkPjki 


02) 

03) 


q^k  = the  probability  of  input  at  state 

Pjki  = the  C0ndit10na1  probability  of  S,.  transferring  to 
S^,  given  input 


Obviously 


Z P4  = 1 
i 1 


04) 


Using  equations  12  through  14,  P^  for  each  i can  be  found. 

The  forecast  accuracy  [FA(R)]  of  a model  R is  defined  as: 

FA(R)  - I E CP,qlk  • rax  P,y] 
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Intuitively,  FA(R)  is  the  maximum  average  probability  of  forecasting 
the  next  state  correctly,  given  the  current  input  and  state. 

B.  Example  1. 


a,  1 


b,  1/2 


The  figure  shown  above  is  a simple  model  R with  two  states  and  two 
input  variables.  Assume  input  a and  b are  equally  probably  at  each 
state.  Simple  calculation  usings  equations  10,  11,  and  12  gives: 

2 1 

state  frequency:  PA  = y PB  = I 


Therefore,  state  A is  visited  twice  as  frequently  as  state  B is  visited. 
FA(R)  = PA  • qAa  • max  (pAaA,  pAaB>  + PA  * <lAb  * max  {PAbA*  PAbB} 

+ PB  ‘ qBa  ’ max  {PBaA’  PBaB}  + PB  ’ qBb  ' max  ^PBbA’  PBbB} 
Since  qAa  = qAb  qBa  = qBb  = \ PA  = f PB  = T 


Hence  FA(R)  = • max  (1,  0}  + y • max  {p  ]■}  + j • max  (1,  0} 


+ | • max  {1,  0}] 


Therefore 


FA(R)  = 


The  average  chance  of  forecasting  the  next  state  correctly  is  11/12, 
given  the  current  state  and  input. 

It  is  trivial  to  see  that  a deterministic  model,  like  a finite 
state  machine,  has  a forecast  accuracy  of  1. 


The  following  are  some  of  the  trivial  properties  of  FA(R): 
(1)  FA(R)^max  E E 
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if  number  of  state  is  2 


r 


m 


* 


U 

f 


v 


(2)  FA(R)  =1-11  Ptqik  min  P.,^ 

(3)  Define  FA(Rlk)  = £ P.  max  P... 

i j 1KJ 

Then  min  FA(R|k)  < FA(R)  < max  FA(Rjk) 
k k 


6.  THE  STRING  AND  MODEL.  Consider  the  following  string: 

aAaAaAbAbBbBaAbAbBaAbBaAbB  ...  (15) 

where  A and  B are  state  variables  and  a and  b are  input  variables. 

After  sufficient  observation,  a model  like  that  shown  below  can  be 
developed. 


b,  1/2 


Combining  the  last  state  with  the  current  state,  or  putting  the  current 
state  to  the  left  upper  corner  of  the  next  state  gives 

AAAABBAABABAB 
aAaAaAbAbBbBaAbAbBaAbBaAbB  . . . 

Putting  the  upper  characters  down  gives 

A A A 

aAaAAaAAbAAbABbBBaBAbAAbABaBAbABaBAaABbB  ...  (16) 

String  16  is  said  to  be  a First  Order  Derivative  (FOD)  of  string  15. 
FOD's  are  developed  to  increase  the  number  of  states  so  that  the  system 
is  better  described.  For  example,  if  string  15  is  an  observer's  weather 
record  with  A and  B meaning  sunny  and  rainy,  respectively,  and  a and  b 
meaning  decreasing  temperature  and  increasing  humidity,  respectively. 

An  FOD  of  String  15  can  be  derived  to  String  16  with  AA  as  sunny,  AB  as 
cloudy,  BA  as  partly  cloudy,  and  BB  as  rainy.  Therefore,  String  16  is 
more  descriptive  than  String  15. 
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7.  THE  DERIVATIVE  OF  THE  MODEL. 


A.  Definition.  Like  a string,  the  model  also  has  derivatives. 
Let  R be  an  IMC. 

R — {I»  S,  H},  I = { x , x2>  • . } , 


S = {Sr  S2  . . . sm},  H = {Px  j = 1,  . . . n) 


Similar  to  Example  1,  assume  that  input  a and  b are  equally  probable 
at  each  state.  Then  it  can  be  shown  that 

FA(R)  = FA(R')  (17) 

In  general,  equation  17  is  not  true.  However,  the  sufficient  con- 
dition of  it  can  be  found. 


Define  index  sets  of  R,  R"  as  follows: 

T = (r  | SreS} 

T'  = (r  | S^eS'} 

T|  = {r  | S'eS  x S.}  where  S x S.  = (SjS.  I jeT} 


LEMMA  1: 


(1)  For  all  i eTT  jeT  there  exists  j eT' 

oi  o 

(2)  For  all  i eTC  j eT'  there  exists  jeT 

0 10 

Proof:  Trivial 

LEMMA  2: 


= '^0 


S’t*  P1M^1>0 


For  all  iQeTr 


max 

jeT 


max 

jeT" 


Proof: 


Let 

pikj = pikj,  wherejieT 

08) 

By  Lemma  1 

P 

in  = ki  < inax  Pj  \j ■;  for  all  i eT. 

ikJi  ^0kJ2-joeT  Tokjo  0 1 

09) 

Conversely, 

,et  ToW  ‘ Vl 

(201 

By  Lemma  1 

pw  i pfkj3  i ™ p1kj 

Therefore 

max  P..  . = max  PI  for  all  i eTT 

jeT  1KJ  jeT'  VJ  01 

QED 
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THEOREM: 


If  q..  = qC  . for  all  i eT?  and  if  E P'  = P,  for  all  i 
'k  ’ok  0 1 10£TJ  ’o  1 

FA(R)  = FA(R") 


Proof: 


Sinre 


FA(R)  = E E P.q ..  max  P 

i 1 1 -i  if 


FA(R')  = E E P:  q:  . max  P:  . . 

i0k  V.k  J0  VJo 


=EE  E p:  q:  . max  P:  . . 
i k i0eTr  7o  V jQ  VJo 


EE  E PI  q.,  max  P...  (by  Lemma  2) 
i k iQeTr  7o  lk  j 1,CJ 

\ I [(<>ik  7 Vd 


Hence 


FA(R)  = FA(R') 


It  is  not  surprising  that  for  most  models  R and  its  FOD  R',  con- 
ditions 


s v 


for  all  i0eT7 


and  E PI  = P.  for  al 1 i 
i eT;  7o  1 


are  easily  satisfied;  thus,  the  forecast  accuracy  of  the  FOD  R^  is  not 
lost. 


8.  APPLICATION.  The  Markov  Chain  has  been  applied  to  many  manage- 
ment science  or  systems  analysis  fields.  IMC  improves  the  Markov  Chain 
because  it  has  more  features  to  adapt  the  real  work  of  physical,  economic, 
biological,  or  engineering  systems  [1],  [4],  [6].  The  most  important 
feature,  the  input  controllability,  allows  one  to  understand  a system 
by  controlling  input  to  find  the  subsequent  changes  in  state  (and  out- 
put). Because  the  processing  is  stochastic,  the  finite  state  machine 
(or  automata)  cannot  describe  the  procedure  properly.  If  a model  can  be 


' J 

build  corresponding  to  a string  of  data,  the  model  can  then  be  tested 
and  evaluated  by  calculating  its  forecast  accuracy.  The  FOD  is  a useful 
tool  in  understanding  the  model,  as  illustrated  by  the  weather  forecasting 
example. 

Some  stochastic  automata  have  already  been  applied  to  the  reliability 
problem  and  decision  process  [8].  It  is  hoped  that  the  discussion  in  this 
paper  will  create  a new  interest  in  the  research  in  a discrete  system. 
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ABSTRACT.  Selected  rock  samples  were  tested  to  failure  in  bending 
tension  and  compression  test  modes  within  the  vacuum  stage  of  a scan- 
ning electron  microscope  (SEM).  The  load  was  applied  slowly  such  that 
crack  initiation  and  growth  could  be  observed  and  recorded  by  photo- 
graphy and  video  tape.  The  failure  surfaces  were  further  evaluated  by 
standard  methods  to  determine  failure  mechanisms  involved  for  each  test 
mode  and  rock  type. 

1.  INTRODUCTION.  An  understanding  of  the  physical  properties 
and  behavior  of  rock  materials  (rock  engineering)  is  necessary  to  im- 
plement a systems  approach  for  designing  a structure.  Structural 
design  considerations  may  include  rock  removal,  tunneling,  use  of 
rock  as  a foundation  material,  or  any  combination  of  these  factors. 
Information  about  the  fundamental  mechanisms  of  the  fatigue  and  fail- 
ure properties  of  rock  is  essential  and  should  be  available  to  the 
design  engineer.  Since  construction  of  underground  structures  such 
as  tunnels  for  defense  facilities,  underground  power  plants,  and  hy- 
draulic structures  has  increased,  and  since  idealized  construction 
sites  are  not  always  available,  it  is  essential  that  rock  failure 
mechanisms  be  controlled  by  proper  design  practice. 

There  have  been  few  investigations  concerning  the  failure  modes  of 
rock  materials  in  simulated  field  tests,  primarily  because  of  the  exper- 
imental problems  associated  with  controlling  rock  failure.  Wawersik, 

Brace,  and  Fairhurst  (AROD  Proposal  11278  EN)  have  investigated  the 
post-failure  behavior  of  selected  materials.  Brace  and  Sprunt  have  in- 
vestigated the  microcavities  in  crystalline  rocks;  and  Brace  (ARO  Contract: 
DAHCO  4-73-C-0017)  is  presently  investigating  the  microstructure  in 
crystalline  rocks  with  a scanning  electron  microscope.  The  study  herein 
complements  these  and  other  investigations  by  advancing  the  state-of- 
the-art  of  failure  mechanisms. 

2.  EQUIPMENT.  The  AMR  900  Scanning  Electron  Microscope  (SEM)  is 
a hiah-resolution  instrument  providing  surface  resolution  of  100  to 
200  a and  useful  magnification  of  up  to  50,000X.  The  depth  of  focus  is 
accurate  to  tens  of  microns.  This  means  that  a fairly  rough  surface, 
such  as  a rock  fracture  surface,  will  remain  in  focus  at  high  magni- 
fications. The  micrograph  obtained  appears  similar  to  that  obtained 
from  the  reflection  light  microscope,  but  it  has  much  better  resolution 
and  depth  of  field. 
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The  AMR  1300  stage  and  chamber  door  assembly  is  inserted  in  the  AMR 
900  Chamber  to  replace  the  standard  door  and  stage.  Depending  on  the 
type  of  test  to  be  performed,  the  bending  or  tension-compression  device 
is  mounted  on  this  assembly  during  its  operation.  A platform  mounted  on 
the  base  plate  provides  the  X motion  (right  and  left),  the  Y motion  (back- 
ward and  forward)  and  the  Z motion  (up  and  down).  The  Z motion  is  actuated 
by  articulated  shafts  to  the  door  and  a single  knob  on  the  front  face  of 
the  door.  One  revolution  of  the  exterior  knob  represents  a change  in  Z 
of  1 mm;  the  readout  is  such  that  one  digit  corresponds  to  a change  in 
specimen  neight  of  0.1  mm.  A counter-clockwise  rotation  of  the  stage 
raises  the  bending  stage  and  closes  the  compression-tension  heads.  The 
same  revolution  and  motion  changes  apply  to  the  X and  Y directions. 

The  bending  stage  (Figure  1)  is  custom-designed  to  load  a rectangu- 
lar specimen  having  maximum  dimensions  of  1 x 1 x 6 in.  in  simple  three- 
point  bending  to  a maximum  load  of  2000  lb.  This  stage  is  essentially  a 
platform  having  a knife  edge  on  its  top  surface  that  supports  the  speci- 
men at  the  center  of  its  bottom  surface.  A load  bar  connected  to  the 
platform  by  a ball  screw  and  gear  system  is  connected  to  the  edge  which 
bears  down  on  the  top  of  the  specimen.  The  points  of  the  specimen's  ten- 
sion links  continually  vary  to  acconmodate  specimens  of  3 to  5 in.  in 
length.  The  maximum  bar  deflection  is  0.375  in.,  it  is  applied  via  the 
hand  crank  on  the  outside  of  the  chamber  door.  Each  digit  of  the  read- 
out corresponds  to  a specimen  deflection  of  0.004  mm  at  no  load. 

The  bending  device  is  loaded  into  the  chamber  parallel  to  the  Y 
axis  at  an  angle  of  45  degrees  to  the  horizontal.  Two  positions  180 
degrees  apart  are  possible,  allowing  observation  of  the  tension  face  or 
a side  face  of  the  specimen. 

The  tension- compression  stage  (Figure  2)  consists  of  two  heads 
mounted  on  a pair  of  right-  and  "left-hand  ball  screws.  When  the  screws 
are  rotated,  the  heads  move  either  together  or  apart,  but  remain  par- 
allel. Compression  specimens  are  placed  between  the  flat  surfaces  of 
the  heads  for  testing.  Tension  specimens  may  be  held  in  place  by  var- 
ious techniques.  In  this  study,  square  steel  heads  with  a centered 
slot  and  a pin  hole  normal  to  the  slot  were  epoxied  to  the  specimen  ends. 
These  in  turn  were  connected  by  the  pin  to  threaded  rods,  flattened  at 
one  end,  which  fed  through  the  holes  in  the  stage  heads  (Figure  3). 

The  gear  train  used  for  specimen  deflection  is  the  same  used  for  the 
! . - bending  state;  however,  one  digit  of  readout  corresponds  to  0.005  mm 

^ change  in  distance  between  the  heads.  Minimum  distance  between  the  heads 

is  0.25  in.,  and  maximum  distance  is  4.0  in.  The  maximum  load  which  may 
■ be  applied  to  either  failure  mode  (tension-compression)  is  2000  lb.  The 

stage  itself  is  tilted  at  an  angle  of  15  degrees  to  the  horizontal;  how- 
» ever,  since  a specimen  may  be  placed  in  any  orientation  between  the  heads, 

any  desired  tilt  may  be  obtained. 
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3.  MONITORING  DEVICES.  A secondary  (backscatter)  electron  image  for 
direct  observation  of  the  specimen  is  developed  and  displayed  on  a signal 
modulation  unit.  (This  is  the  primary  visual  means  of  specimen  observa- 
tion. ) 

In  addition,  the  secondary  image  nay  be  displayed  on  a 9-in.  square 
TV  rate  monitor  display  unit.  This  unit  displays  the  same  field  as  the 
previous  module,  but  has  a limited  magnification  range  of  from  100X  to 
10,000X.  It  has  a built-in  zoom  capability  that  allows  closeup  display  of 
a small  area  in  the  center  of  the  TV  and  is  operable  at  all  magnifications. 

Photomicrographs  are  obtained  through  a record  oscilloscope  4x5  in. 
square  and  Polaroid  52-P/55-P/N,  4 x 5 in.  film.  An  alphanumeric  generator 
is  integrated  into  the  signal  modular  display  unit  in  order  to  facilitate 
identification  and  description  of  the  photomicrographs. 

4.  SPECIMEN  PREPARATION.  Table  1 lists  the  representative  suite  of 

rock  samples  chosen  for  evaluation  in  this  study  and  summarizes  their 
physical  characteristics.  One  set  of  specimens  was  prepared  for  each  of 
three  test  modes:  bending,  tension,  and  compression.  In  addition,  three 

cross-sectional  dimensions  were  prepared  to  determine  any  specimen  size 
effects. 

Bending  (flexure)  specimens  were  sawed  into  beams  and  ground  square 
in  lengths  from  4 to  5 in.  long  and  cross  sections  of  1/8,  1/4,  and  1/2  in. 
square.  A fine  notch  was  filed  into  the  top  (tension)  surface  to  control 
crack  origin  during  scanning  at  high  magnifications.  This  notch  was  approx- 
imately 1/16  in.  deep  for  all  specimens. 

Tensile  specimens  were  prepared  in  the  same  manner,  but  were  cut  2-1/P 
in.  in  length.  Notches  were  ground  into  opposite  sides  of  the  specimens  to 
minimize  extraneous  stress  concentrations  at  other  points  in  the  specimen 
test  mode.  The  intact  cross-section  varied  from  3/16  to  1/4  in.,  dependina 
on  the  specimen  size.  Only  the  tensile  specimens  were  modified  for 
testing;  a steel  head  was  epoxied  to  each  end  to  facilitate  application 
of  pure  tensile  stresses. 

Compression  specimens  were  prepared  similarly  to  the  tensile  speci- 
mens in  lengths  of  2 to  3 in.,  with  no  notching  or  other  preparations 
made  after  grinding. 

a.  specimens  were  strain-gaged,  coated  under  vacuum  with  gold- 
platinum  to  facilitate  conductivity,  and  wrapped  in  aluminum  foil.  The 
purpose  of  the  foil  wrapping  was  to  prevent  spalling  durina  testing  or 
at  a failure  which  could  harm  internal  portions  of  the  vacuum  system. 


5.  SPECIMEN  FAILURE  EVALUATION.  The  selected  rock  specimens  were 

evaluated  for  bending,  tension  and  compression  failure.  Prior  to  the 
SEM  evaluation,  representative  strain-gaged  specimens  were  tested  to 
failure  in  each  mode  outside  the  vacuum  drawer  to  determine:  (1)  if 

failure  at  each  size  in  each  mode  was  feasible;  (2)  the  extent  of  spalling, 
if  any;  and  (3)  where  and  how  failure  would  occur  on  the  specimen. 

Problems  encountered  were  associated  with  the  compressive  failure  mode, 
which  proved  to  spall  excessively  and  to  fail  unpredictably  along  the  en- 
tire length  of  the  specimen.  For  the  Westerly  Blue  granite  and  trap- 
rock  shale  specimens,  the  test  size  had  to  be  reduced  to  1/4  in.  square 
(in  compression)  to  facilitate  the  2000  lb  maximum  load. 

6.  SEM  FAILURE  METHODS.  The  bending  (flexure)  failure  mode  was  first 
evaluated  in  the  vacuum  drawer  by  applying  load  to  a specimen  up  to  a 
strain  level  approaching  failure.  At  the  point  approaching  failure,  the 
load  application  was  slowed  to  approximately  0.3  mm/min.  The  notch  area 
was  scanned  during  this  load  application.  Slow  load  application  was  con- 
tinued until  crack  initiation,  when  the  load  was  stooped  and  the  crack 
scanned.  If  the  crack  was  partial,  loading  was  applied  again  while  the 
crack  tip  was  followed  with  a scan.  Loading  was  halted  periodically  for 

a side  to  side  scan.  For  the  bending  failure  mode,  there  were  no  signifi- 
cant changes  indicated  on  either  side  of  the  failure  plane. 

For  the  tensile  failure  mode  evaluation,  a slight  seating  load  was 
applied  manually  to  the  specimen  before  placing  it  in  a vacuum,  so  that 
the  specimen  would  not  rotate  during  load  application.  Since  this  test 
mode  builds  up  stress  prior  to  failure,  most  specimens  failed  rapidly, 
even  at  a very  small  load  rate.  In  some  cases,  a scan  was  possible  be- 
fore complete  separation.  When  side  scans  were  performed  in  this  failure 
mode,  such  secondary  phenomena  as  grain  separation  were  present. 

7.  FAILURE  SURFACE  EVALUATION.  After  completion  of  the  SFM  failure 
evaluation,  one  surface  of  each  failed  specimen  was  mounted  on  studs  and 
coated  with  gold-platinum.  These  surfaces  were  then  evaluated  by  stan- 
dard SEM  evaluation  procedures  and  a standard  stub  stage.  This  evaluation, 
together  with  the  SEM  failure  evaluation,  was  the  basis  of  the  failure 
analysis. 

8.  FAILURE  ANALYSIS.  When  the  beams  failed  in  a bending  mode,  both 
intergranular  and  transgranul ar  failure  mechanisms  were  present,  usually 
in  approximately  equal  distribution;  however,  different  rock  types  exhib- 
ited each  failure  mechanism  to  different  degrees.  The  Bonne  Terre  limestone 
and  Westerly  Blue  granite  exhibit  approximately  equal  distribution  of  the 
inter-  and  transgranul ar  failure  mechanisms.  The  Traprock  shale  and 
Murphy  marble  beams  primarily  displayed  transgranular  failure  and  inter- 
granular failure.  The  Danby  marble  primarily  showed  intergranular  failure 
and  some  transgranular  failure,  while  the  Berea  sandstone  exhibited  100 
percent  intergranular  failure  mechanism. 
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The  tensile  failure  test  mode  showed  no  preferences  to  either  rock 
type  or  crystal /grain  size  relative  to  the  failure  mechanisms.  Both 
inter-  and  transgranular  failure  mechanisms  were  approximately  equally 
distributed  for  each  rock  type  evaluated.  These  figures  also  indicate 
the  variation  in  crystal/grain  size  for  the  six  rock  types  evaluated. 

The  Westerly  Blue  granite,  Murphy  marble,  and  Bonne  Terre  limestone 
display  good  crystal  cleavage  planes.  The  Berea  sandstone  exhibits 
surface  wear  on  the  individual  sand  grains. 

As  anticipated  in  the  compression  mode,  transgranular  failure  mech- 
anisms were  present  due  to  the  nature  of  the  test.  However,  the  Berea 
sandstone  and  Bonne  Terre  limestone  exhibited  an  unexpectedly  excellent 
intergranular  failure  mechanism.  The  Westerly  Blue  granite,  Murphy  marble, 
and  Traprock  shale  exhibited  predominantly  (95  percent)  transgranular 
failure;  the  Danby  marble  displayed  both  failure  mechanisms,  with  inter- 
granular failure  predominating. 

Table  2 summarizes  the  failure  mechanisms  relative  to  test  mode 
and  rock  type. 

9.  SUMMARY  AND  CONCLUSIONS.  Selected  rock  samples  were  prepared 
and  tested  to  failure  by  bending,  tension,  and  compression  within  the 
vacuum  stage  of  a scanning  electron  microscope.  Load  was  applied  very 
slowly  in  order  to  observe  crack  initiation  and  growth.  Crack  growth 
was  observed  visually  and  recorded  by  both  photography  and  video  tape. 

The  crack  surfaces  of  the  failed  specimens  were  evaluated  by  standard 
methods,  and  two  evaluation  technques  were  used  to  determine  the  failure 
mechanisms  for  each  test  mode  and  rock  type  studied. 

Conclusions.  Based  on  the  techniques  of  stub  evaluation  and  failure  in 
the  vacuum  stage,  the  following  statements  apply  only  to  those  test  modes 
and  rock  materials  studied  herein: 

a.  Cross-section  size  differences  had  no  effect  on  the  failure 
mode.  The  only  benefit  derived  from  studying  several  sizes  were  facili- 
tation of  compression  testing  of  granite  and  shale  specimens. 

b.  The  rock  types  evaluated  in  this  study  had  no  apparent  effect 
on  the  failure  mode  or  the  failure  mechanisms. 

c.  Crystal/grain  size  directly  and  significantly  influences  the 
failure  mechanisms  as  follows: 

(1)  Large  crystals/grains  - failure  was  primarily  transgranular 
for  each  test  mode. 

(2)  Small  crystals/grains  - failure  was  primarily  intergranular 
for  each  test  mode. 
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d.  Cementing  agents  have  little  or  no  effect  on  the  gross  failure 
mechanisms;  however,  failure  in  the  cementing  agent  was  exclusively 
transgranular. 

10.  RECOMMENDATIONS.  This  study  has  proved  the  feasibility  and 
usefulness  of  applying  a metallurgical  research  tool  to  geologic  mate- 
rials. The  present  study,  in  conjunction  with  studies  by  Brace  of 
specimen  preparation  techniques,  could  yield  valuable  information  in 
the  area  of  geophysics  and  earthquake  analysis.  Studies  relative  to 
slickenside  development  in  clay  shales  and  other  shear  phenomena  of  soil 
and  rock  could  be  advanced  by  this  approach. 
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ABSTRACT.  The  Phase  II  Secure  Voice  Program  (P2SVP)  will  develop, 
acquire  and  install  a high  quality,  effective,  long  haul  DOD  secure 
voice  system  that  will  serve  up  to  10,000  subscribers  in  the  1985  time 
frame  to  provide  requisite  interoperability  with  strategic  and  tactical 
systems.  It  replaces  the  Phase  I Automatic  Secure  Voice  Communications 
(AUT0SEV0C0M)  Network  and  Interim  Conferencino  for  the  National  Military 
Command  System  (NMCS). 

The  independent  army  analysis  was  a unique  effort  because  this 
was  the  first  time  the  Army  was  asked  by  the  Secretary  of  Defense  to 
evaluate  another  agency's  program. 

The  Director,  Telecommunications  and  Command  and  Control  Systems, 
Office  of  the  Secretary  of  Defense  requested  the  Army  to  prepare 
Independent  Cost  Estimates  (ICE's)  of  the  P2SVP  alternatives  developed 
by  the  DCA  in  support  of  Development  Concept  Paper  (DCP)  #153.  These 
Independent  Cost  Estimates  were  to  be  prepared  for  the  Defense  Systems 
Acquisition  Review  Council,  Office  of  the  Secretary  of  Defense,  Cost 
Analysis  Improvement  Group  (DSARC,  OSD,  CAIG).  This  analysis  provided 
input  for  the  full-scale  engineering  development  decision  point. 

4 

HQ,  DARCOM  established  a Systems  Study  Group  (SSG),  Chaired  by 
myself,  consisting  of  representatives  (multi-and  inter-discipl inary) 
from  COA,  CSA,  ACC,  ECOM,  DCA,  NSA,  DCEC,  DDR&E,  DTACCS,  and  OSD. 

This  SSG  generated  an  ICE  by  analyzing  the  Phase  II  computer  printouts 
at  the  Defense  Communications  Engineering  Center  (DCEC),  supported  by 
engineering  judgement,  mathematical  analysis,  expert  opinion  and 
historical  data.  These  estimates  were  prepared  in  accordance  with 
the  Army  Materiel  Guide  for  Organizing  and  Presenting  Cost  Studies, 
and  the  HQ,  Department  of  the  Army  Investment  and  0&S  Cost  Guides  for 
t Army  Materiel  Systems. 


1.  INTRODUCTION.  An  analysis  of  P2SV  and  alternatives  was  made 
previously  by  the  Defense  Communications  Agency  (DCA)  in  the  form  of 
an  Economic  Analysis  Estimate  (EAE).  The  ICE  described  in  this  paper 
provides  an  independent  evaluation  of  the  costs  generated  in  that  EAE. 
Such  an  evaluation  is  a normal  procedure  in  the  acquisition  of  Army 
materiel  systems.  Together  with  the  benefits  (effectiveness)  calculations 
made  in  the  EAE,  it  allows  a ranking  of  the  candidate  systems  to  be  made 
and  gives  visibility  to  the  decision  maker  of  the  trade-offs  involved. 
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2.  DESCRIPTION  OF  ALTERNATIVES.  Independent  Cost  Estimates 
were  made  on  4 alternatives:  Worldwide  Tenley,  Narrowband,  Wideband 

and  Hybrid  Systems  for  the  Phase  II  Secure  Voice  Program.  Summary 
descriptions  are  given. 


TABLE  1 - SUMMARY  DESCRIPTION  OF  ALTERNATIVES 

II  NARROWBAND 
8 KBPS 

Modified  Autovon  Worldwide 
Bell  field  COMSEC 

Red/Maroon  Interface  with  Tri-Tac 


IV  HYBRID 

16  KBPS  Overseas 
8 KBPS  Conus 
Modified  Autovon  CONUS 
TTC-39  Overseas 
Bel Ifield  COMSEC  CONUS 
Tri-Tac  COMSEC  Overseas 

The  AN/TTC-39  is  a family  of  modular  and  transportable  commun ice  cion 
switching  systems  designed  to  provide  secure  automatic  switching  for 
tactical  voice  and  message  traffic.  The  family  consists  of  hybrid  circuit 
switches  varying  in  size  from  450  to  750  terminations  by  increments  of 
150  analog  or  digital  terminations  and  message  switches  equipped  for  25 
or  50  terminations. 


I WORLDWIDE  TENLEY 
16  KBPS 

Modified  Autovon  CONUS 
TTC-39  Overseas 
Predominently  Wideband 
Tri-Tac  Type  COMSEC 

III  WIDEBAND 
16  KBPS 

Modified  Autovon  CONUS 
TTC-39  Overseas 
Bell  field  COMSEC  CONUS 
Tri-Tac  COMSEC  Overseas 


A more  detailed  description  of  the  four  alternatives  are  given 

below: 

| i 

A.  ALTERNATIVE  I.  The  Worldwide  Tenley  provides  for  16  KBPS  (Wideband) 
continuously  variable  slope  delta  modulation  (CVSD)  terminals  for  all  users. 
Clear,  secure  voice  capability  will  be  provided  from  the  same  16  KBPS 
terminal.  Leased  CONUS  autovon  switches  will  be  modified  to  emulate 
certain  AN/TTC-39  switch  features  and  the  government  owned  switches 
- '}  averseas  will  be  replaced  with  AN/TTC-39  type  switches.  Concentrations 

| of  subscribers  will  be  provided  access  via  a new  automatic  4-wire  Digital 

Access  Exchange  (DAX)  concentrator.  End-to-end  encryption  will  be  provided 
for  all  calls  within  the  network,  except  for  conferencing  and  NB/WB  con- 
versions requiring  red  interfaces.  Automatic  remote  electronic  crypto- 
graphic key  distribution  will  be  provided  with  the  Tri-Tac  Tenley  COMSEC 
concept  in  both  CONUS  and  overseas. 
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B.  ALTERNATIVE  2.  Alternative  2 provides  8 KBPS  Narrowband  voice 
processor  terminals  and  Bell  field  COMSEC  in  both  CONUS  and  overseas 
portions  of  the  DCS.  CONUS  Autovon  switches  will  be  modified  for 
digital  operation  and  Bernhardt  KDC's  will  be  used  in  CONUS  and 
overseas.  A red  interface  KDC  will  be  required  for  the  DCS  Bell  field 
COMSEC  to  interoperate  with  the  Tri-Tac  Tenley  COMSEC.  Overseas,  the 
existing  government-owned  autovon  switches  will  be  modified  for  digital 
operation.  End-to-end  voice  encryption  will  be  maintained  on  intra-DCS 
calls  since  all  users  will  have  compatible  terminals.  However,  calls 

to  Tri-Tac  will  require  8 to  32  KBPS  voice  interfaces  that  will  prohibit 
end-to-end  encryption  and  insert  voice  degradation. 

C.  ALTERNATIVE  3.  Alternative  3 provides  a Worldwide  Wideband 
(16  KBPS)  system  using  Bel  1 field  COMSEC  in  CONUS  and  Tenley  COMSEC 
overseas.  As  opposed  to  the  Tenley  alternative,  it  will  not  have  COMSEC 
functions  at  each  modified  CONUS  autovon  switch.  Instead,  up  to  3 stand- 
alone Bernhardt  KDC's  will  be  dispersed  throughout  CONUS  to  serve  the 
CONUS  DCS.  CONUS  autovon  switches  will  be  modified  to  provide  digital 
service.  The  CONUS  voice  terminal  will  be  procured  to  operate  in  the 
Bellfield  COMSEC  mode.  Overseas,  this  alternative  will  require  a special 
interface  KDC  to  allow  interoperation  of  the  CONUS  Bellfield  and  the 
overseas  Tenley  key  distribution  systems.  Voice  interoperability  with 
Tri-Tac  subscribers  and  end-to-end  encryption  will  be  available. 

D.  ALTERNATIVE  4.  Alternative  4,  the  Hybrid  alternative,  provides 
8 KBPS  Narrowband  Voice  Terminals  with  Bellfield  COMSEC  in  CONUS  and 

16  KBPS  voice  terminals  with  Tenley  COMSEC  overseas.  The  Bellfield 
COMSEC  in  CONUS  will  be  achieved  with  Bernhardt  KDC's.  The  CONUS  Secure 
voice  terminals  will  be  the  product  of  a separate  Narrowband  development. 
CONUS  autovon  switches  will  be  modified  for  digital  operation.  Overseas, 
the  program  will  be  identical  to  the  Wideband  alternative,  except  that 
an  interface  will  be  required  between  the  two  dissimilar  voice  terminals 
of  each  geographic  area.  This  will  preclude  end-to-end  encryption  of 
voice  calls  between  CONUS  and  overseas  DCS  or  CONUS  DCS  and  Tri-Tac,  and 
will  introduce  noticeable  voice  degradation  for  these  calls. 

3.  METHODOLOGY.  The  methodologies  used  in  this  analysis  included 
cost  estimating  relationships,  regression  analysis,  learning  curve, 
engineering  estimates,  analogy,  delphi,  cost  factors,  complexity  factors, 
contractor  quotes,  previous  experience,  and  subjective  judgement. 

The  methodology  employed  for  the  investment  portion  of  the  ICE 
consisted  of  the  formulation  of  the  equipment  requirements  package, 
research  of  available  cost  data,  determination  of  hardware  costs  by 
analogy  and  support  costs  from  historical  information  and  cost 
estimating  guidelines.  The  cost  data  elements  of  the  investment 
analysis  include  hardware,  military  construction,  engineering,  instal- 
lation and  testing,  material,  initial  spares,  test  equipment,  data, 
training,  packing,  packaging  and  transportation. 
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The  operation  and  support  methodology  consisted  of  cost  estimating 
relationships,  computer  models,  expert  opinion,  analogy,  contractor 
quotes,  cost  factors,  and  exponential  regression  analysis.  The  cost 
data  elements  of  the  O&S  analysis  consisted  of  personnel,  consumption, 
training,  integrated  logistics  support,  maintenance,  procurement  of 
switch  modification,  transportation,  recurring  spares,  leasing  and 
utilities. 

The  methodology  for  the  R&D  cost  estimates  were  expert  opinion. 

64  individual  R&D  tasks  were  analyzed  using  a modified  delphi  technique 
and  a computer  routine.  The  cost  data  elements  of  the  R&D  analysis 
included  engineering,  tooling  and  prototypes. 

A.  As  an  example  of  the  mathematical  techniques  used  in  estimating 
costs,  an  analysis  of  CONUS  transmission  costs  is  given.  These  AT&T 
leased  lines  will  be  used  for  digital  rather  than  the  usual  analog 
transmission;  thus  there  was  no  relevant  experience  to  obtain  data. 

Two  factors  were  involved  in  the  analysis,  the  first  of  which 
was  the  increase  in  the  number  of  digital  service  areas  expected.  This 
is  expected  to  result  in  a linear  decrease  in  total  transmission  costs 
of  2%/year  for  10  years.  The  second  factor  anticipates  a reduction 
in  costs  for  providing  digital  transmission  due  to  technological 
advances  and  increased  equipment  production.  This  decrease  is  expected 
to  start  in  1980  and  is  expressed  by  the  exponential  regression, 

DC=  1/2  (1  + e _t/3) 

Where  0*t*10  corresponds  to  the  years  1980  to  1990.  This 
expression  results  from  an  exponential  regression  analysis  using  all 
available  information  on  present  and  past  transmission  leasing  costs. 

B.  The  approach  to  estimating  Operating  and  Support  (O&S)  costs 
was  as  follows.  Operator  costs  were  calculated  by  multiplying  he 
number  of  operators  required  for  each  equipment  by  the  annual  p^y  and 
allowance  for  the  operator's  grade  level. 

Maintenance  costs  were  calculated  by  multiplying  the  cost  per 
active  maintenance  man-hour  by  the  total  annual  maintenance  hours 
per  equipment.  Total  annual  maintenance  man-hours  were  calculated  by 

AMMH  = HOP  (MTTR/MTBF) , 

where:  AMMH  = Annual  Maintenance  Man-hours. 

HOP  = Hours  of  Operation  Per  Year 
MTTR  = Mean-time-to  Repair 
MTBF  - Mean-time-between  Failure 
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Depot  overhaul  costs  for  labor  and  material  were  calculated  by 
multiplying  the  depot  overhaul  cost  by  the  overhaul  rate  to  equal  the 
depot  cost  per  unit  per  year.  The  overhaul  rate  indicated  how  often 
the  unit  was  expected  to  be  sent  back  to  depot  for  overhual . The 
depot  overhaul  cost  was  estimated  by 

DOC  = 0.809  (DOR)  (UC)  881 

where:  DOC  = Depot  Overhaul  Cost/Year 

UC  = Unit  Hardware  Cost 
DOR  = Depot  Overhaul  Rate 
Standard  Error  = +60%,  -37% 

C.  Cost  Estimating  Relationships  (CER)  were  used  to  estimate 
costs  for  various  equipments.  For  example,  the  CER  used  for  the  TTC 
automatic  switching  equipment  was 

Y0  = 27284.7  + 0.002  X8  - 0.125X  1,7  + 24.898xJ'5 
2 12  3 


where: 


Y^  = Acquisition  Cost 
X.|  = Weight 
X^  = Volume 


X^  = Number  of  Lines 

4.  UNCERTAINTY  ANALYSIS.  In  all  cases  of  projected  cost  estimates 
some  degree  of  uncertainty  will  exist  and  it  is  therefore  advisable  to 
state  projected  cost  estimates  in  terms  of  most  likely  value,  lowest 
value,  and  the  most  pessimistic  (highest)  value.  The  most  likely 
value  would  be  that  value  normally  used  in  planning,  programming  and 
budgeting. 


The  ratios  of  high  and  low  values  to  most  likely  (taken  as  1) 
are  given  in  Table  2 below  for  the  preferred  alternative  1 for  R&D 
and  0&S  costs. 


TABLE  2 - UNCERTAINTY 

ANALYSIS  (ALTERNATIVE  1) 

LOW 

MOST  LIKELY 

HIGH 

R&D 

.957 

1 

1 .024 

O&S 

.850 

1 

2.054 
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The  uncertainty  in  the  investment  costs  was  analyzed  for  the 
major  equipments.  The  uncertainties  are  given  in  terms  of  percentages 
of  the  most  likely  costs. 


TABLE  3 - INVESTMENT  UNCERTAINTY  ANALYSIS 


EQUIPMENT 
AN/TTC-39  Switch 
Ten  ley  Family 
Bellfield  Family 
Loran  C 

Dax  (Concentrator) 

DSVT 

Goldwine  Mod 
Conference  Directors 
Transmission  Equipment 

SERVICE  AND  SUPPORT 
Engineer,  Install  and  Test 
Repair  Parts 
Test  Equipment 
Data 


UNIT  COST 

- 

+ 

18% 

8%~ 

298K 

15% 

15% 

292K 

25% 

25% 

20K 

5% 

.5% 

58K 

15% 

40% 

4.3K 

10% 

10% 

10. 9K 

60% 

10% 

273K 

50% 

50% 

15% 

15% 

10% 

100% 

15% 

15% 

20% 

20% 

20% 

50% 

tion 

15% 

25% 

5.  SENSITIVITY  ANALYSIS.  Cost  sentivity  analysis  is  a technique  within 
the  context  of  both  individual  system  and  force  structure  cost  analysis. 
It  involves  the  systematic  examination  of  the  effects  of  changes  in  total 
force  structure  cost  resulting  from  variations  in  characteristics,  size, 
and  composition  of  force.  The  variables  considered  in  conducting  the 
sensitivity  analysis  were  the  number  of  subscribers,  manning  levels, 
changes  in  terminals,  logistics  cost,  CONUS,  leasing  costs,  and  planning 
horizons. 

6.  COST  BENEFIT  ANALYSIS.  By  using  standard  methods  of  measuring 
benefits  (measures  of  effectiveness) , benefit/cost  ratios  were 
calculated  for  the  4 alternatives.  The  values  are  given  below: 


TABLE  4 - COST  BENEFIT  ANALYSIS 


Alternative 

Tenley 

Narrowband 

Wideband 

Hybrid 


Benefit/Cost 

484 

305 

378 

296 


7.  SUMMARY  COSTS.  The  table  below  gives  the  summary  costs  in  both 
constant  and  inflated  FY76  dollars.  The  inflated  costs  of  over  a billion 
dollars  is  a large  but  not  untypical  program  for  our  analysis  and 
evaluation. 


TABLE  5 - ICE  P2SV  GENERAL  COST  SUMMARY 


(Constant  76  $ M) 


ALTERNATIVES 

1 

II 

III 

IV 

m 

377F 

30 

37.6 

38.8 

Investment 

179.8 

209.3 

173.1 

248.8 

O&S 

569.7 

495.2 

555.5 

570.4 

TOTAL 

"78771 

743.8 

766.2 

85875" 

(Inflated 

76  $ M) 

R&D 

44.2 

45.6 

44.2 

45.7 

Investment 

235.9 

279.3 

230.2 

329.2 

O&S 

1026.2 

882.3 

1000.9 

1017.1 

TOTAL 

1306.3 

1207.2 

1275.3 

1392.0 

8.  CONCLUSION.  In  this  presentation  I have  attempted  to  give  the 
highlights  of  the  Army's  independent  analysis  of  the  P2SVP,  as  well  as 
some  of  the  complementary  calculations  used  in  an  economic  analysis  that 
are  needed  in  the  decision  acquisition  process. 
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ABSTRACT.  The  solution  of  the  class  of  problems  governed  by 
a set  of  first  order  linear  differential  equations,  subject 
to  a set  of  linear  constraints,  and  the  minimization  of  a 
defined  quadratic  performance  index  is  presented.  The  number 
of  differential  equations  must  be  greater  than  the  number 
of  constraints,  otherwise,  there  is  a unique  solution  and 
control  is  not  possible.  The  solution  is  considered  as 
known  once  the  correct  initial  conditions  are  found;  a 
number  of  initial  value  methods  are  available  to  solve 
linear  differential  equations.  Only  discrete  controls 
are  considered  here,  depicting  the  real  world  where  contin- 
uously variable  controls  are  not  always  present.  Using  the 
above,  systems  of  the  open  loop  type  are  examined. 

The  method  consists  of  superposition  of  linearly  inde- 
pendent particular  solutions  to  get  the  optimal  solution.  The 
particular  solutions  are  generated  using  a power  series 
integration  technique  on  a perturbed  set  of  arbitrarily  chosen 
conditions.  The  superposition  constants  are  deter- 
mined so  that  the  solution  both  meets  the  constraints  and 
minimizes  the  quadratic  performance  index.  The  minimum  point 
is  found  using  a method  developed  by  Childs  and  Maron  for 
the  explicit  minimum  solution  to  a set  of  quadratic  equations 
subject  to  a set  of  linear  constraints. 
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1.  INTRODUCTIC^l.  The  solution  to  a class  of  control  problems  with  discrete 
controls  is  presented.  The  class  of  problems  examined  are  those  governed  by 
a set  of  n first  order  linear  ordinary  differential  equations,  subject  to  a 
set  of  m linear  constraints  (m<n) , wherein  a given  quadratic  performance 
index  is  to  be  minimized.  The  discrete  controls  appear  in  the  solution  as 
initial  values  of  the  differential  equations.  Those  initial  values  which  are 
unspecified  by  constraints  are  determined  optimally  by  minimizing  the  given 
quadratic  performance  index. 

The  letter  y is  used  for  an  n elanent  state  variable  vector  which  is 
assumed  to  be  a function  of  the  independent  variable  t,  time.  The  dot  ( ) 
is  used  to  denote  the  total  derivative  with  respect  to  t.  The  general  set 
of  first  order  linear  ordinary  differential  equations  is  written  as 


y = + f 


t e [o,T] 


where  L is  a n by  n coefficient  matrix  whose  elements  may  be  constants  or 
functions  of  time,  / is  an  n element  vector  of  forcing  functions,  and  [o, T] 
is  the  time  interval  of  interest.  The  solution  of  equation  (1,1)  is  subject 
to  the  linear  equality  boundary  conditions  or  constraints 


q.  (y(t.))  = b. 


i = 1,2,... , m<n 


where  q.  represents  the  boundary  condition  operator  that  specifies  a linear 
ocmbination  of  elanents  of  the  state  vector.  The  ith  boundary  value,  b^, 
at  the  specified  value  of  time,  t^.  A quadratic  performance  index,  h,  where 

T 

h = J y'Mydt 


is  to  be  minimized.  The  n by  n matrix  M is  symmetric  and  known  function  of 
time , t . The  prime  ( ) ' is  used  to  indicate  the  transpose  of  a vector  or 
matrix.  The  above  three  equations  define  the  basic  problem. 

The  solution  to  the  problem  is  uniquely  defined  once  the  state,  y,  is 
known  at  any  time  t.  The  solution  is  considered  as  known  once  the  correct 
initial  conditions,  y(o) , are  kncwn.  The  y(o)  vector  gives  the  desired  control 
parameters,  and  it  can  be  used  with  the  differential  equation  (1.1)  to  generate 
an  accurate  solution  for  y as  a function  of  t.  This  is  due  to  the  availability 
of  a variety  of  initial  value  differential  equation  problem  solvers  for  today's 
digital  computers. 

The  solution  method  is  a superposition  of  solutions,  a "shooting  method".  [6] 

The  usual  methods  of  solving  similar  controls  problems  involve  the  use 
of  Lagrange  multipliers,  Hamiltonians,  co-state  equations,  etc.  which  are 
unnecessary  in  the  method  presented  in  this  paper.  [5]  The  techniques  used 
in  the  usual  methods  require  a large  amount  of  mathematical  gymnastics  in  the 
solution  process. 


2.  A SHOOTING  METHOD.  A particular  solution  of  equation  (1.1)  is  a solution 
of  a particular  set  of  initial  conditions.  We  define  such  a solution  as 


+ f 


(2.1) 


where  the  superscript,  k,  is  an  index  which  denotes  the  kth  particular  solution. 
The  state  vector,  y , is  determined  by  the  superposition  of  particular  solutions, 
and  is  expressed  as 


y = Pa 


(2.2) 
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Rewriting  after  factoring  out  L and  / fran  the  surma tions  (since  they  are  not 
indexed  by  k)  and  substituting  equation  (2.3)  and  the  derivative  of  equation 
(2.3)  with  respect  to  t into  equation  (2.4)  gives 


; i 


4 


;) 


r 

y = L y + f l a.  (2.5) 

k=0 

Comparing  equations  (1.1)  and  (2.5)  establishes  a constraint  which  the  super- 
position constants  must  meet: 

l a.  = 1 (2.6) 

k=0 

The  traditional  superposition  of  homogeneous  solutions  on  a single  particular 
solution  does  not  have  a similar  constraint.  However,  because  we  superinpose 
particular  solutions,  we  need  to  program  only  one  set  of  equations  for  each 
problem. 

Independence  of  Solutions  and  Boundary  Value  Constraints.  The  reason  for  the 
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superposition  of  the  particular  solutions  is  to  satisfy  the  boundary  conditions 
or  constraints.  This  requires  all  r+1  subsets  of  the  r particular  solutions  to 
be  linearly  independent.  To  insure  this,  P(o)  is  created  using  the  perturba- 
tion strategy:  First , arbitrary  estimates  are  made  of  the  r unknown  values  of 
y(o)  and  this  vector  is  used  for  p^° ^ (o) . Second,  columns  1 through  r of 
are  generated  by  making  each  column  the  same  as  p^°^  (oj , except  that  each  has 
one  nonzero  perturbation  from  one  of  the  estimated  elements  of  p^°^(o).  Each 
estimated  element  is  perturbed  in  one  and  only  one  column.  This  strategy 
gives  the  desired  independence. 

The  boundary  conditions  are  of  the  form  specified  in  equation  (1.2) . 

For  control  problems , these  boundary  conditions  are  usually  initial  conditions, 
but  this  is  not  required.  As  stated  previously,  r denotes  the  number  of  elements 
of  y(o)  not  uniquely  specified  by  equation  (1.2),  and  thus,  (n-r)  elements  of 
y(o)  are  uniquely  specified.  If  m is  not  equal  to  (n-r) , then  there  are 
m-(n-r)  boundary  conditions  at  times  greater  than  zero.  Subsittution  of 
(2.2)  into  (1.2)  gives 

qi  (PftJ  a)  = b i = 1,2,..., m (2.7) 

which  can  be  rewritten  for  linear  operators  q ^ , as 

l q-  (t.))  a1  = b . i = l,2,...,m  (2.8) 

yto  % 1 « t- 

Of  these  m linear  equations,  (n-r)  specify  known  values  of  y(o)  and  m-(n-r) 
specify  constraints  on  the  unknown  values  of  y(o)  in  terms  of  the  (r+1) 
unknown  superposition  constants,  the  a^' s.  With  the  addition  of  constraint 
equation  (2.6),  there  are  m-(n-r)+l  constraints  with  (r+1)  unknowns.  Since  the 
problem  statement  declares  that  m is  less  than  n , it  is  evident  that  m-(n-r)+l 
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is  less  than  r+1 , and  thus  it  is  an  underdetexmined  system.  Therefore,  the 
a^'s  are  not  uniquely  specified,  and  we  can  choose  them  to  minimize  the  per- 
formance index  of  equation  (1.3). 

Optimizing  on  the  Basis  of  the  Quadratic  Performance  Index.  The  a vector  is 
now  included  in  the  performance  index  by  the  substitution  of  equation  (2.2) 
into  equation  (1.3)  which  gives 

T 

h = f a'  P'  m Pa  dt  (2.9) 

o 

The  (r+1)  by  (r+1)  matrix  A is  defined  by: 

T 

A = J P'  M P dt  (2.10) 

o 

It  is  possible  to  rewrite  equation  (2.9)  as 

h(a)  = a'  A a (2.11) 

The  method  that  is  used  to  solve  for  A is  to  calculate  the  solution  of  the 
initial  value  problem 

A = P'  M P A(o)  = 0 (2.12) 

The  superposition  equation  (2.3)  requires  that  (r+l)*n  first  order  linear 
ordinary  differential  equations  be  integrated  and  the  matrix  A,  which  is  symmetric 
because  the  matrix  M is  synmetric,  may  be  determined  by  integrating  an  additional 
(r+1)  * (r+2)/2  first  order  linear  ordinary  differential  equations. 

In  solving  for  the  optimum  a vector,  the  explicit  formula  developed  by 
Childs  and  Maron  (1975)  is  utilized.  This  formula  states  that  the  solution  for 
a such  that 


h(a)  = a'  A a = minimum 


(2.13) 


subject  to 


Ka  = o 


(2.14) 


is 


a =-a  - N(N'  AN) 


-1 


N'  A a 


(2.15) 


where  k is  m-(n-r)+l  by  ( r+1 ) and  of  rank  m-(n-r)+l } a is  a particular  solution 
of  equation  (2.14) , and  the  columns  of  N form  a basis  for  the  null  space  of 
K.  The  ( ) in  equation  (2.15)  denotes  a matrix  inverse.  By  using  appro- 
priate matrix  operations,  it  is  possible  to  transform  equation  (2.14)  into  the 
equivalent  system 


I 


W 


a = d 


This  can  be  used  to  define  a and  N as 

P 


a = 

r*  h 

_d_ 

and  N = 

’ -W  ' 

P 

o 

~i~ 

(2.16) 


(2.17) 


The  J's  in  equations  (2.16)  and  (2.17)  are  identity  matrices  of  appropriate 
order. 


3.  AN  EXAMPLE.  The  first  problem  chosen  is 

x + 0.2.x  + x = Uj  + Ugt  t e [0310] 

subject 

x(o)  = 0 x(10)  = 1 x(o)  - 0 


(3.1) 


(3.2) 
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and 


r 2 ‘2 

h — j (x  + x ) dt  = minimum 


In  state  variable  form,  this  can  be  restated  as 


yl  y2 

y2=  -y2 
y3  = o 

y4  = 0 


- 0.  2y 2 + y3  + y4t 


subject  to 


y2(o)  = 0 y1(1o)  = 1 


y2(o)  = o 


where 


10 

f y'  M y dt 


10  0 0 
0 10  0 
0 0 0 0 
0 0 0 0 


Using  an  accuracy  of  10  and  evaluating  power  series  to  10  terms  results  in 
the  following  solution 


y1(o)  = 0. 
y2(o)  = 0. 

y Jo)  = -0.105453 


y4(o) 


0.115041 
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The  y ^ and  y elanents  are  the  forcing  function  aonstants  or  control. 
The  solution  for  y ^ and  y over  tlie  interval  [0,10]  is  given  in  Table  1. 


4.  CPNCI1JSIONS.  A direct  method  has  been  shown  for  the  solution  of  linear 
ordinary  differential  equations  subject  to  minimization  of  a quadratic  per- 
formance index  and  multipoint  boundary  values.  The  method  avoids  the  necessity 
of  Lagrange  multipliers  and  other  similar  tools. 

The  method  can  easily  be  incorporated  into  boundary  value  codes.  Most 
problems  will  have  nonlinearities  which  can  be  handled  in  the  usual  manner 
[3],  [6]. 
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NUMERICAL  SOLUTIONS 


Time 

1 

h 

0 

0 

0 

0.5 

-0.010 

-0.035 

1.0 

-0.028 

-0.031 

1.5 

-0.035 

0.006 

2.0 

-0.018 

0.066 

2.5 

0.031 

0.130 

3.0 

0.111 

0.186 

3.5 

0.213 

0.219 

4.0 

0.325 

0.225 

4.5 

0.434 

0.205 

5.0 

0.527 

0.166 

5.5 

0.598 

0.118 

6.0 

0.646 

0.075 
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0.045 

7.0 

0.694 

0.034 

7.5 

0.713 

0.043 

8.0 

0.740 

0.068 

8.5 

0.782 

0.102 

9.0 

0.841 

0.135 

9.5 

0.916 

0.161 

10.0 
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0.173 

y3  = -0.105 
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On  Generalized  Feller  Equation 
Siegfried  H.  Lehnigk 

Physical  Sciences  Directorate,  US  Army  Missile  Command 
Redstone  Arsenal , AL 


ABSTRACT 

The  generalized  Feller  equation 

i>(z)  = Azxx  + Bzx  + Cz  - zt  = 0,  z = z(x,t),  x > 0,  t ->  0, 


with  the  coefficients 


A(x) 

B(x) 


C(x) 


will  be  considered, 
equation. 


\ + 1 

= ax  , ar  > 0,  XeR,  X * 1, 

= BxxX  + p2x,  B1  2eR, 

" P*  "1  + B2,  p = x[p  L - cv ( 1 + X)], 

The  choice  of  p makes  i(z)  = 0 a Fokker-Planck 


Solutions  of  i( z)  = 0 will  be  derived  for  given  initial  and/or 
boundary  conditions.  The  derivation  of  initial  condition  solutions 
is  based  on  a basic  solution  of  a[z)  = 0 and  its  adjoint. 


Tne  complete  paper  is  published  elsewhere. 
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A PERTURBATION  METHOD  FOR  FREE  BOUNDARY 
PROBLEMS  OF  ELLIPTIC  TYPE* 

B.  A.  Fleishman  and  Thomas  J.  Mahar'1' 
Department  of  Mathematical  Sciences 
Rensselaer  Polytechnic  Institute 
Troy,  New  York  12181 


ABSTRACT.  Nonlinear  partial  differential  equations  (PDE's) 
arise  in  many  scientific  contexts,  and  boundary  value  problems 
(BVP's)  for  such  equations  present  formidable  computational  dif- 
ficulties. Thus  analytical  techniques  for  approximating  the  sol- 
utions of  such  problems  have  practical  significance. 


A formal  perturbation  method  is  described  here  for  approxi- 
mating solutions  of  certain  BVP's  for  elliptic  PDE's  containing 
discontinuous  nonlinearities.  To  illustrate,  we  treat  in  detail 
the  BVP 


P(e) 


xx 


+ u 


yy 


+ f ( u ) = o 


in  S:  0 < x < 1,  -<*>  < y < °o 


u(0,y)  = eh(y),  ux(l,y)  = 0 for  -®  < y < ® 


where  e is  a small  parameter,  h is  periodic  and  uniformly  bounded, 
and  f is  a step-function:  f(u)  = 0 for  u < y,  f(u)  = 1 for  u >_  y 

(y  a positive  constant) . u and  3u/3n  are  to  be  continuous  across 
any  "free  boundary"  u=y.  If  0 < y < 1/4,  problem  P(0)  is  shown 
to  nave  at  least  one  non-trivial  solution  u^  = u„(x)  such  that 
u„ (x)  = y (0  < x < 1) . For  y e (0,1/4)  anapproximate  solution 
u(x,y)  of  P ( e ) involving  a free  boundary  in  S is  then  sought  in 
the  form  u(x,y)  = Uq(x)  + eu(x,y),  with  the  free  boundary  assumed 
to  be  x = x + eg  (y)  . 


Two  examples  are  considered,  h(y)  = cos  y and  h a trigonomet- 
ric polynomial,  in  which  the  linear  (variational)  equation  for  u 
may  be  solved  by  separation  of  variables. 

An  unusual  feature  of  our  procedure  is  that  this  equation  for 
u contains  a delta-function  coefficient,  because  in  the  originial 
equation  f is  a step-function  in  u. 

1.  INTRODUCTION.  Nonlinear  partial  differential  equations 
(PDE's)  arise  in  many  scientific  contexts,  and  boundary  value 


Research  supported  by  U.  S.  Army  Research  Office. 

Present  address:  Courant  Institute  of  Mathematical  Sciences, 

New  York  University,  251  Mercer  Street,  New  York,  New  York 
10012  (U.S.A.) 


problems  (BVP's)  for  such  equations  present  formidable  computa- 
tional difficulties.  Thus  analytical  techniques  for  approximating 
the  solutions  of  such  problems  have  practical  significance. 

We  illustrate  here  a perturbation  method  applicable  to  cer- 
tain BVP's  for  elliptic  PDE ' s of  the  form 

Au  + f (x, u)  = 0 (1) 

where  x = (x^,...,xn)  is  a point  in  Rn,  A denotes  the  Laplacian 

operator,  u is  a real  scalar  variable,  and  f is  a piecewise- 

continuous  function  of  x.,...,x  and  u.  When  f has  jump  discon- 

i n 

tinuities  with  respect  to  u,  among  the  interfaces  across  which  f 
changes  abruptly  there  may  be  so-called  "free  boundaries"  which 
are  not  known  a priori  but  must  be  found  along  with  the  solution 
u = u (x)  . 

Suppose  f is  a step-function  in  u and  depends  also  on  m of 
the  independent  variables,  say,  x^,...,x  , where  0 _<  m < n.  Let 

D be  a fixed  region  in  Rn  whose  bounding  surfaces  are  independent 

of  X .,••«, X . 

m+1'  n 

Now  consider  a BVP  for  (1)  on  D,  denoted  by  P(e),  in  which  a 
small  parameter  e occurs  in  the  boundary  conditions  in  such  a way 
that  the  "reduced"  problem  P(0)  does  not  involve  xm+i ' • • • ' xn • 

If  a solution  Ug  = Ug  (x^,  . . . , x^  of  P(0)  is  obtainable,  we  seek  a 

solution  of  P(e)  in  the  perturbed  form  u = Ug  + eu,  with  free 

boundaries  (if  any)  which  are  perturbations  of  free  boundaries  of 
P(0).  As  we  shall  see  in  the  specific  problem  considered  below, 
for  certain  boundary  data  it  is  easy  to  find  u and  the  perturbed 
free  boundary. 

The  unusual  mathematical  feature  of  this  procedure  is  that 
we  perturb  about  a surface  of  discontinuity,  which  introduces  a 
delta-function  into  the  (variational)  equation  satisfied  by  u. 

Our  development  is  formal;  assuming  that  the  solution  we  seek  exists 
and  that  it  can  be  closely  approximated  by  an  expression  of  the 
form  Ug  + eu,  etc.,  we  calculate  u and  the  modified  free  boundary. 

Free  boundary  problems  for  equation* similar  to  (1)  occur  in  plasma 
physics;  in  [1] , for  example,  the  authors  consider  equations  of 
the  form  Lu  + f(x,u)  = 0,  where  L is  an  elliptic  operator  and  f 
is,  however,  piecewise-linear  in  u,  not  discontinuous.  Free  bound- 
ary problems  for  equations  of  the  form  div  (K  grad  u)  =0,  where 
K = K(x,u)  is  a piecewise-continuous  function  (which  arise  in  the 
equilibrium  Stefan  problem  [2]  and  govern  certain  diffusion  and 
metallurgical  processes)  are  also  being  investigated  by  the  method 
illustrated  here. 

Besides  occurring  naturally,  problems  with  discontinuous  non- 
linearities  are  sometimes  introduced  as  approximations  (e.g.,  see 
[3])  to  problems  with  smooth  nonlinearities  (which,  in  general, 
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can  not  be  solved  explicitly) . The  authors  are  investigating  the 
feasibility  of  deriving  approximate  solutions  of  BVP 1 s for  equa- 
tions of  type  (1)  in  which  f is  bounded  and  has  smooth  dependence 
on  u,  by  first  replacing  the  smooth  function  f with  one  which  is 
a step- function  in  u,  then  employing  the  procedure  described  here 
to  treat  the  approximating  problem.  In  this  connection  it  is  im- 
portant to  note  that  if  the  perturbation  procedure  is  applied 
directly  to  an  equation  of  the  form  (1)  containing  a smooth  non- 
linearity f,  the  variational  equation  (to  be  solved  for  u)  will 
always  have  variable  coefficients. 

The  remainder  of  this  paper  (Sections  2,  3*  4 and  5)  is 
devoted  to  applying  the  perturbation  technique  to'  the  particular 
BVP  consisting  of  equations  (2)  and  (3)  below. 


2.  A PARTICULAR  FREE  BOUNDARY  PROBLEM.  Let  us  denote  by 
P(e)  the  following  two-dimensional  BVP  for  a nonlinear  PDE  in  the 
vertical  strip 


S = { (x,y) : 0 < x < 1,  -®  < y < «}; 


P(e) 


Au  + f (u)  = 0 


in  S (2 ) 


[ u (0 , y ) = eh (y)  , ux(l,y)  = 0 (-»  < y < ®)  (3) 

2 2 2 2 

Here  A = 3 /9x  + 9 /9y  , h is  a given  continuous,  bounded,  per- 

iodic function,  e _>  0 is  a (small)  constant,,  and  f is  a step- 
function  with  given  threshold  value  y >_  0 : 


f (u) 


0 

1 


u < y 
u > y 


(We  could  also  write  f(u)  = H(u  - y),  where  H is  the  Heaviside 
unit  function. ) 

Solutions  of  P ( e ) will  be  required  to  be  periodic  and 
(therefore  bounded)  in  the  closure  of  S.  In  particular,  then,  u 
and  its  normal  derivative  9u/9n  must  be  continuous  across  any  free 
boundary  (not  known  a priori),  where  u = y. 

Suppose  that  h is  bounded  by  1,  also  that  0 £ e < y.  Then 
by  continuity,  u < y at  points  of  S close  to  the  left  boundary 
x = 0.  If  u < y throughout  S,  f = 0 in  S and  P(e)  is  a (linear) 
BVP  for  Laplace's  equation.  For  solutions  satisfying  u > y some- 
where in  S,  however,  P(e)  is  not  linear.  Analysis  of  the 
"reduced"  problem  P(0)  (see  Section  3)  suggests  that  for  small 
positive  y,  P(e)  possesses  solutions  of  both  the  linear  and  non- 
linear problems. 

The  nonlinear  case  of  P(e)  is  of  interest  here.  We  seek 
an  approximate  solution  in  the  form  u = Uq  + eu,  where  Uq  is 

a (known)  solution  of  the  (one-dimensional)  nonlinear  problem 
P(0);  likewise  free  boundaries  in  P(e)  are  assumed  to  be  pertur- 
bations of  the  free  boundaries  in  P(0).  In  Section  3 we  obtain 
the  solution(s)  of  P(0)  for  all  non-negative  values  of  y;  in  par- 
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ticular,  it  is  shown  that  when  0 £ y <_  j there  are  non-trivial 
solutions. 


In  Section  4 we  perturb  the  PDE  (2)  about  Uq  and  obtain 

the  (linear)  variational  equation  for  u.  In  Section  5,  taking 
h(y)  = cos  y,  we  solve  the  BVP  for  u by  separation  of  variables. 
The  free  boundary  is  determined  by  substituting  for  x,  in  the 
interface  condition 


u(x,y)  = uQ ( x)  + eu(x,y)  = u , 

the  assumed  form  x = x + eg(y),  where  Uq  (xj  = y (that  is,  x = x 

is  the  interface  for  the  reduced  problem)  and  g is  the  periodic 
function  which  we  must  find.  Also  for  more  general  boundary  data 
(namely,  h a trigonometric  polynomial)  the  variables  can  be 
separated;  in  this  case  we  merely  sketch  the  procedure. 


3,  ANALYSIS  OF  P(0) . When  e = 0,  the  boundary  conditions 
(3)  are  both  independent  of  y;  thus  P(0)  reduces  to  the  following 
one- dimensional  problem: 


P(0) 


u"  + f(u)  =0 
u ( 0)  = 0,  u’  (1)  = 0 


in  I:  0 < x < 1 


(4) 


where  ' = d/dx.  We  shall  find  all  C solutions  for  y > 0. 


Note  first  that  all  solutions  are  non-negative,  because 
u(0)  = 0 and  u1  (x)  >_  0 on  I.  The  latter  follows  from  the  facts 
that  u"  = - f(u)  £ 0 (wherever  u"  exists)  and  u' (1)  = 0. 

When  y = 0,  (4)  takes  the  form  u"  = -1  on  I.  Then  P(0)  has 

the  unique  solution  u(x)  = x - x^/2. 

For  any  fixed  y > 0,  P(0)  has  the  trivial  solution  u(x)  = 0. 

In  order  for  a non-trivial  solution  to  exist,  it  is  necessary  that 
there  be  a smallest  value  x in  I such  that  u(x)  = y.  Then  u(x)  < y 
for  0 _<  x < x and  (since  u'  (x)  0)  u(x)  y for  x £ x _<  1 . There- 

fore a non-trivial  solution  of  P(0)  must  satisfy 


u"  = 0 

for 

0 < 

X < X 

u"  + 1 = 0 

for 

x < 

X < 1 

plus  the  boundary  and  continuity  conditions 


u (0 ) = 0 , u' (1)  = 0 

u(x+)  = u(x-)  = y,  u'  (x+)  = u'  (x-) 
for  some  x in  I. 


(5) 


(6) 


Solving  the  differential  equations  (5)  on  their  respective 
intervals,  then  subjecting  them  to  conditions  (6) , we  find  that 
for  y > 0, 
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u0(x) 


(7) 


i 


(l-x)x  (0£X£X) 

x - i(x^  + x^)  (x  £ x <_  1) 


is  a solution  of  P(0)  provided  x (0  < x < 1)  satisfies 

x(l  - x)  = u . (8) 

This  quadratic  equation  has  distinct  real  roots  x in  I when 
0 < y < 1/4,  the  double  root  x = 1/2  when  y = 1/4,  and  complex 
roots  when  y > 1/4. 

We  can  now  describe  the  numbers  and  types  of  C"*-  solutions 
of  P (0)  for  all  non-negative  values  of  y: 

2 

y = 0:  Unique  solution:  u(x)  = x - x /2 . 


0 < y < 1/4:  Three  solutions:  the  trivial  one  plus  two 

solutions  given  by  (7),  each  corresponding  to  a different 
root  of  ( 8)  . 


y = 1/4:  Two  solutions:  the  trivial  one  plus  one  given 

by  (7)  when  x = 1/2. 

y > 1/4:  Unique  solution:  u(x)  = 0. 

4.  THE  PERTURBATION  PROCEDURE.  Henceforth  our  attention  is 
restricted  to  values  of  y e (0,1/4). 

As  seen  in  Section  3,  for  each  such  y,  P(0)  possesses  two 
non- trivial  solutions  in  addition  to  the  trivial  one.  Focussing 
on  the  nonlinear  case,  we  have  reason  to  expect  (see  [4])  that 
there  exists  a solution  of  P(e)  close  to  at  least  one  of  the 
non-trivial  solutions  Uq(x)  of  P(0). 

For  fixed  y e (0,1/4),  let  uQ(x)  be  the  solution  (7)  of 

P(e)  corresponding,  say,  to  the  smaller  root  of  equation  (8);  thus, 
0 < x < 1/2.  (The  formal  calculation  which  follows  is  the  same  for 
either  root.)  We  shall  assume  that  the  y-periodic  solution  of 
P ( e)  close  to  this  un(x)  can  be  written,  neglecting  terms  which  are 

0(e2),  0 

u(x,y)  = Uq (x)  + eu(x,y)  , (9) 

where  u is  a function  periodic  in  y and  uniformly  bounded  in 
the  closed  strip. 

Similarly,  we  assume  that  the  solution  (9)  has  a free  boundary 
which  may  be  represented 

x = x + eg(y),  (10) 

that  is,  as  a perturbation  of  the  "free  boundary"  x = x in  Uq(x). 
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Subtracting  Aun  + f(un)  = 0 from  Au  + f(u)  = 
that  (formally) 


f (U)  = f ( Uq  + eu)  " f(uQ)  + f'(u0)*EU 
we  obtain  the  variational  equation 
Au  + f ' (Uq) u = 0 , 


0 and  noting 


or 


Au  + 


6 (x  - x) 

uo(x) 


u 


0 


(11) 


where  we  have  used  the  identities 

f ' (uQ (x) ) = H' [uQ (x)  - y]  = 6 [uQ (x)  - y]  = 6 (x  - x)/u^ (x)  . 

From  (3),  (9)  and  Uq(0)  = 0,  Uq  (1)  = 0 follow  the  boundary  condi- 

tions on  u: 

u(0,y)  = h (y)  , |^(l,y)  = 0 (-*  < y < 00 ) (12) 

In  the  next  section  two  examples  are  considered  in  which 
h actually  varies  with  y in  a periodic  fashion.  First  we  can 
gain  some  confidence  in  the  validity  of  the  perturbation  pro- 
cedure from  consideration  of  the  simple  example 

h (y)  = e (0  £ e < y ) . 

In  this  example  P(e)  is  itself  a one-dimensional  problem; 
we  are  still  interested  in  the  nonlinear  case.  Without  giving 
details  we  point  out  that  if  first  one  solves  P(e)  exactly  (by 
an  analysis  similar  to  that  of  P(0)  in  the  previous  section) , 
then  seeks  an  approximate  solution  in  the  form  u = Uq  + eu, 
with  interface  x^  = x + eg  (by  solving  the  BVP  (11  -12)),  one 

finds  that  the  latter  expressions  agree  with  the  exact  representa- 
tions for  u and  x^  through  terms  of  first  order  in  e. 

5.  EXAMPLES.  We  give  two  examples  in  which  the  linear  BVP 
(11  - 12)  can  be  solved  by  separation  of  variables. 

EXAMPLE  1;  In  P(e)  let 


h(y)  = cos  y 


Substitution  in  (11)  and  (12)  of 
u(x,y)  = v(x)  cos  y 
yields  the  BVP 


v"  - 
v(0) 


V + 
= 1 


6 (x  - x) 

u0(x) 

, V (1) 


V = 


= 0 


0 


(0  < x < 1) 


(13) 
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The  differential  equation  in  (13)  implies  a jump  condition 
at  x = x.  Suppose  v(x)  is_a  solution  continuous  on  [0,1].  Inte- 
grating the  equation  from  x - n to  x + n (n  small  and  positive) , 
then  letting  n -+■  0,  we  find  that  the  slope  of  v(x)  undergoes  a 
jump  at  x = 3?: 


r'  (x+)  - v(x-)  = - v(x)/A  , 


(14) 


where 


A = Uq(x)  = 1 - x 


Now  solving  v"  - v = 0 on  each  of  the  intervals  0 < x < x 
and  x < x < 1 (so  that  we  have  four  arbitrary  constants) , then 
imposing  the  boundary  conditions  from  (13) , the  jump  condition 
(14)  and  the  continuity  condition  v(x+)  = v(x-) , we  obtain  for 
BVP  (13)  the  continuous  solution 


v(x)  = 


cosh  x + A sinh  x 
B cosh  (1  - x) 


(0  _<  x _<  x) 
(x  < x < 1) 


(15) 


where 


A = B [y  cosh  x cosh  (1  - x)  - sinh  1] 


B = [cosh  1 - i sinh  x cosh  (1  - x)  ] 1 


(16) 


We  seek  the  free  boundary,  for  the  solution  u(x,y)  given  approxi- 
mately by  (9) , as  a perturbation  of  x = x,  the  free  boundary  for 
In  other  words,  it  is  assumed  that  u = y along  a curve 


u0(x) 


X = X + eg (y) 


(17) 


where  g is  a periodic  function  and  terms  of  order  z‘ 
are  neglected. 


Substitution  of  x + eg(y)  for  x in 
Uq(x)  + ev(x)  cos  y = y 


gives 


uQ  (x  + eg (y) ) + ev(x  + eg(y))cos  y = y , 


UQ  (x)  + Ug(x) 


eg(y)  + ev(x)cos  y + 0(e  ) = y, 
eAg(y)  + ev(x)cos  y = 0 , 


(18 


where  we  have  used  u~(x)  = y,  ui(x)  = A,  and  the  fact  that  while 
v is  not  differentiable  on  [0,1]  it  is  Lipschitzian.  Finally 
from  (18) 


B 


g(y)  = - (v(x)/A)cos  y = - — cosh  (1  - x)  cos  y . 


(19) 
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It  should  be  remarked  that  u(x,y)  = uQ(x)  + ev(x)cos  y, 
where  uQ  and  v are  given  by  (7)  and  (15)  respectively,  is  not 

in  S,  as  required.  It  is  only  when  we  adjust  the  (free) 
boundary  between  the  left-  and  right-hand  regions,  by  wiggling 
the  interface,  that  we  obtain  a C*  (approximate)  solution. 

To  sum  up,  for  given  y e (0,1/4)  and  0 < e < p we  have  de- 
rived, by  a formal  perturbation  scheme,  an  approximate  solution 
of  P ( e) , which  is  and  periodic  in  y,  of  the  form 


(1  - x)  x + e (cosh  x + Asinh  x)  cos  y,  0<^x<^x+g(y) 


u (x,y) = 


|x  - ^(x2  + x2)  + eB  cosh(l  - x) cos 

where  x is  the  smaller  root  of  (8),  while  A, 
by  (16)  and  (19)  respectively. 


y,  x + g(y)  < x<_l 
B and  g(y)  are  given 


It  may  be  shown,  finally,  that  the  requirement  that  9u/3n  be 
continuous  across  the  interface  is  satisfied  to  within  terms  of 
order  e2. 


EXAMPLE  2:  In  P(e)  let 

N 

h(y)  = aQ  + Z (an  cos  ny  + bn  sin  ny) 


where  N is  a positive  integer.  Because  the  treatment  is  similar 
to  that  of  the  previous  example,  we  shall  only  touch  on  the  points 
of  difference. 


_Again  we  fix  jj  e (0,1/4),  choose  the  root  of  (8)  satisfying 
0 < x < 1/2,  and  require  0 < e < u.  To  insure  |h(y)  | <_  1,  let 

N 

la0l  + ^_  ( I an  I + I bn  I } — 1 • 


Again  assuming  the  approximate  solution  of  P(e)  to  have  the 
form  (9)  and  the  free  boundary  to  have  the  form  (10) , we  are  led 
to  the  BVP  (11  - 12).  Instead  of  u(x,y)  = v(x) cos  y,  however,  we 
now  set  N 

u(x,y)  = aQv0(x)  + £ (anvn(x)cos  ny  + bnwn(x)sin  ny)  . 


Substituting  this  for  u in  (11)  and  (12) , then  separating  vari- 
ables, we  find  that  for  n = 1,...,N,  both  v and  wn  must  be  solutions 
of  the  BVP 


v (0 ) = 1 , v’ (1)  = 0 
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while  Vg  must  be  a solution  for  n = 0. 

Proceeding  as  in  the  previous  example,  one  can  obtain  the 
expressions  for  uQ(x)  + £u(x,y)  to  the  left  and  right  of  the  free 

boundary,  also  the  approximate  representation  x = x + eg(y)  for 
the  free  boundary  itself.  But  we  shall  omit  the  details. 
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ABSTRACT 

We  determine  the  propagation  constants  which  describe  mathemati- 
cally the  behavior  of  electromagnetic  waves  reflected  from  dielectric- 
coated  wires.  These  are  obtained  from  the  roots  of  two  characteristic 
equations  of  transcendental  type.  The  roots  are  the  propagation  con- 
stants of  the  creeping  waves  generated  by  diffraction  of  plane  waves 
polarized  tangentially  and  normally  to  the  wire  axis,  respectively. 
Their  real  and  imaginary  parts  give  the  phase  and  attenuation  of  the 
creeping  waves  around  the  circumference  of  the  wire. 
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Leon  Kotin 
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U.  S.  Army  Electronics  Command,  Fort  Monmouth,  New  Jersey  07703 

1.  Introduction.  The  effectiveness  of  many  communication 
systems  can  be  seriously  diminished  by  reflections  of  electromagnetic 
signals  from  obstacles,  both  natural  and  man-made.  Dielectric-coated 
wires  constitute  a man-made  obstacle  which  appears  with  increasing 
frequency  in  military  situations.  Nor  is  this  obstacle  restricted  to 
communications  effects.  The  U.  S.  A.  Board  of  Aviation  Accident  Re- 
search recently  cited  the  following  statistics  for  a four-year  period 
of  daylight  operations  under  peacetime  conditions.  There  were  156 
accidents  involving  low-flying  aircraft  and  electric  wires.  Those  re- 
sulted in  78  fatalities,  56  injuries,  and  C.6  million  dollars  damage. 

In  this  paper  we  obtain  the  propagation  constants  wl  ich  describe 
mathematically  the  behavior  of  wave s reflected  from  dielectric-coated 
wi res . 

In  an  attempt  to  determine  reasonably  rapid  convergent  series 
representations  for  the  scatter  field  and  radar  response  of  dielectric- 
coated  wires,  F.  Schwering  and  C.  De  Santis  [6]  obtained  two  compli- 
cated characteristic  equations  of  transcendental  type.  The  roots  of 
these  equations  are  the  propagation  constants  of  the  creeping  waves 
generated  by  diffraction  of  plane  waves  polarized  tangentially  and 
normally  to  the  wire  axis,  respectively.  Iheir  real  and  imaginary 
parts  give  the  phase  and  attenuation  of  the  creeping  waves  around  the 
circumference  of  the  wire. 
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Introduci ng 


j(2) , 


x = ka  , y = kda  , z = kjh  , Hv(x)  i hh  '(x) 


for  simplicity  into  (1)  - (3),  we  shall  obtain  v as  the  zeros  of  the 


function  U"  = U : 
v v 


Uv  i xlV  (x)Wy(y,z)  - yHv(x)W^(y,z) 


where 


Wv(y,z)  i Jv(y)Yv(z)  - Jv(z)Yv(y) 


(4) 

(5) 


and 


8W 

w'(y,z)  s ^ . 


Using  function-theoretical  and  analytical  techniques,  we  shall 
obtain  first  some  general  qualitative  properties  of  v and  then  analyti- 
cal approximations  to  the  large  zeros.  Finally  we  shall  give  numer- 
ically the  physically  significant  smallest  zeros  for  several  representa- 
tive values  of  x,  y and  z. 

2.  Thr  symmetry  of  the  zeros.  First  we  show  that  the  function 


i v.r/2 


U is  an  even  function  of  v. 
v 


Theorem  1.  If  U is  defined  by  (4),  then 


e-iv^2u  = eW2u 

V -V 


Proof.  We  have  [5] 


J (t)  COS  V7T  - J (t) 
Y (t)  *-* liL 


(6) 


sin  vtt 


whenever  v is  not  an  integer.  (For  integral  n,  Y (t)  = Tim  Y (t). 

n V-*I1  v 

In  this  case,  the  following  argument  can  be  modified  by  taking  limits.) 
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Then 


wv(*’z)  =inrv,|_Jv(y)(C0S  Jv(z)  • J-v(z)) 

-J(z)(cos  VTT  J (y)  - J 


sin  vir 


[-Jv(y)J-v(z)  ' Jv(z)J-v(y)] 


w (y,z)  = W (y,z) 

■ V -v 


Since  [5,  p.  67]  H = e1viTH  , we  have  from  (4) 


U_v  = e~1v1t(xhT  (x)W_^(y,z)  - yHv(x)W:v(y,z), 


whence  from  (8) 


U = e”1viTU 


This  immediately  gives  us  the  desired  result: 


e-ivTr/2LJ  = eivn/2u 

V -V 


An  obvious  consequence  is  that  the  zeros  are  symmetric  with 
respect  to  the  origin  in  the  complex  v-plane. 

Corollary.  If  v is  a zero  of  U^f  so  is  -v. 

It  is  interesting  to  note  that  this  simple  theorem  yields  results 
which  are  far  less  obvious  than  the  above  corollary.  These  results 
refer  to  the  strict  complexity  of  the  zeros  and  thn  infinitude  of  zeros, 
and  appear  in  the  following  sections. 


In  the  rest  of  this 


the  letter  coming  from  the  right-hand  side  of  (14).  The  determinant 

of  coefficients  of  xW  and  yU'  must  then  vanish: 

v v 

A h = 0 (16) 

V V V V 

This,  however,  is  impossible,  since  and  H are  linearly 

independent  solutions  of  Bessel's  equation.  Indeed,  A = -4i/irx  t 0 
[5,  p.  68].  Thus  £5^0  and  the  zeros  are  not  real. 


Applying  a similar  argument  assuming  a = 0,  whence  -v  = v,  and 
taking  the  left-hand  side  of  (14)  give  another  contradiction.  This 
shows  t.iat  the  zeros  cannot  he  pure  imaginary  either,  completing  the 
proof  of  the  theorem. 

4.  The  infinitude  of  zeros.  We  know  from  physical  considerations, 
of  course,  that  there  exist  roots  of  the  characteristic  equation.  We 
now  prove  that  there  are  an  infinite  number  of  these  roots.  To  this 
end,  we  invoke  some  function- theoreti cal  considerations,  such  os  the 
concept  of  order  of  growth  w(f)  of  an  entire  (or  integral)  function 
f(v)  [1,  p.  8;  7,  p.  248],  defined  as  the  infimum  of  all  exponents  p 
such  that 


|f(v)  | = 0(e^ v'  ) as  | v | -*•  °° 

Using  Poisson's  formula  [5,  p.  79]: 


Jv(y>  ■-  ASL 

v /?r(v  + 


71/2  2v 

f cos(y  cos  t)  sin  t dt  , 


we  find  easily  that  when  a > 0, 


,<y)  s 


1 \2 / i / sin  tdt 

/wj  r + h)  | o 


TTTT^W 


< Vir e 

= I r ( v + h)  ~ 


l 

v ln2 


Thus  the  order  of  the  integral  is  at  most  1 when  a s Re  v = 0. 

Moreover  the  entire  function  l/r(v)  is  of  order  1 [7,  p.  255].  Since 
the  order  of  the  product  (or  sum)  is  no  greater  than  that  of  the 
greater  factor  (or  term),  it  follows  that  the  order  of  J^(y)  is  no 
greater  than  unity  when  Re  v = 0. 

To  eliminate  this  restriction  on  the  sign  of  a = Re  v,  we  use  the 
facts  that 


h'"h 


1 [4,  p.  229]  , 


= J + iY 

) V V 


J = J cos  tv  - Y sin  tv 

-v  v v 


from  (19)  and  (20),  we  find  that  to(Y^)  = 1 for  a = 0.  Then  we  con- 
clude from  (21)  and  earlier  results  that  w(J  ) = 1,  with  no  restriction 
on  <*.  Moreover,  since  J , Yv,  Hv  and  their  derivatives  can  be  expressed 
[5,  S 3.1]  in  terms  of  e1x1T  and  the  Bessel  function  J with  indices 
tv,  ± v +1  and  ±v-l  , it  follows  finally  that 

Lemma.  The  order  of  growth  of  Uy  is  less  than  or  equal  to  1. 

2.  j tt/o 

Now  let  = X.  Then  since  e~  v1T'nJ  is  an  entire  even  function 

v 

of  v of  order  = 1,  the  function  f(x)  = e”1v1^U  is  an  entire  function 

v 

of  x whose  order  is  - ■*-.  Consequently  [7,  pp.  250,  252],  f(x)  has  an 

infinite  numbe ' of  2C''ts  X^.  From  the?  definition  of  f(0,  we  conclude 

Theorem  3.  U has  an  infinite  number  of  zeros, 
v 


Moreover  [7,  p.  250]  we  obtain  the  following  product  repre- 
sentation: 

fU)  = f(0)  n (l  - x) 

k=l\  Aky 

Expressed  in  terms  of  U , (22)  becomes 


U = e1v1t  U 

V 


«w( 


1 -4 


where  the  are  the  zeros  of  Uy.  Note  that  from  Theorem  2, 

UQ  f 0,  as  is  required  for  this  product  representation  to  be  valid. 

5.  The  large  zeros  of  U . Since  there  are  an  infinite  number 

y_ 

of  zeros  of  the  entire  function  U , the  zeros  are  arbitrarily 

v 

large.  To  approximate  these  when  |v|  >>  rnax(x,y,z) , we  first 
express  U in  terms  of  J alone  using  standard  identities  [5,  § 3.1], 

V 

obtaining 


. . 2 
2i  sin  vir 


Uv  = X[elV1I(Jv-l(x)  - JV  + l(x))  + J-V  + l(x)  - J-V  - l(x)j 

X + Jv(z)J-v(y)]  + y [elV1Tjv(X)  - J-VW] 


_J-V{Z)( 


J Ay)  - J 

v - I V 


♦ ,<y>)  + V*)(j.v  * ,(y)  - J.v . ,(y))_ 
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Then  using  the  asymptotic  behavior  of  J (t): 


Jp(t) 


1 + 0 


viu  + r 


for  large  y,  and  dropping  the  lower-order  terms,  we  obtain  from  (24) 


v£n(+  2yv/exz)  ~ (n  + $s)iri 


as  + 6 > o , 


implicitly  giving  approximately  the  n-th  zero  for  large  n. 

Since  the  zeros  are  symmetric  in  the  v-plane,  we  can  select 
B > 0 and  thus  drop  the  lower  signs  in  (26).  Rewriting  (26)  as 
v ~ (n  - ^)iri  / ln(- 2yv/exz)  , 

iterating,  and  neglecting  the  lower-order  terms,  we  obtain  the 
following  explicit  approximation  to  the  large  zeros. 

Theorem  4.  The  large  zeros  of  in  the  upper  half-plane 
are  given  by 


f— 

_ -(n  - + (n  - %)iri  In  ((2n  - J^try/exz) 

n 

[ln[  (2n  - ^)iry/exz)]2 

for  sufficiently  large  integers  n. 

As  a consequence, 


('  * (28> 


n 2 


as  n + » 


since  the  real  part  approaches  infinity  more  slovvly  than  the 
imaginary  part.  Furthermore,  it  can  easily  be  shown  from  (28) 
that  the  distance  between  consecutive  zeros  approaches  zero. 

We  remark  that  this  behavior,  indeed  the  asymptotic  represen- 
tation (28),  is  very  similar  to  that  of  H^(x),  which  arises  in 


the  theory  of  diffraction  of  electromagnetic  waves  by  a perfectly 
conducting  sphere  (cf.  [2],  [3],  [4]). 


f 


6.  The  case  of  normal  polarization.  If  the  incident  wave 
is  polarized  normally  to  the  axis  of  the  dielectric-coated  wire, 
the  characteristic  equation  is 


i yH;(x)[jv(,)y;(z)  - rU)Yu(y)]  - xHv(x)[j;(y)Y;(z)  - J;(z)v;(y)] 


Since  the  treatment  of  this  case  is  identical  to  the  previously 
discussed  case  of  parallel  polarization,  it  suffices  merely  to 
state  the  corresponding  results. 

Theorems.  e~Uv/: V~  = eivir/2U  ^ 

V -V 

x 

Corollary.  If  v is  a zero  of  U , so  is  -v. 

v 

x 

Theorem  6.  If  U = 0,  then  Re  vim  v f 0. 

V 

X 

Theorem  7.  U has  an  infinite  number  of  zeros. 

v 

7.  The  smallest  zeros.  Following  are  a table  and  curves  (Fig.  ?) 
of  the  smallest  zeros  of  and  UA  in  the  second  quadrant  of  the 

V V 

complex  v-plane  for  each  of  several  representative  values  of  the 
parameters  x,  y,  z.  These  values  are  x = 0.5(0. 5)5,  with 
y = 1.5  x and  z = 0.9  y.  We  recall  that  x = ka,  y = k.a,  and 


w 

z - k^b  where  k is  the  free-space  wave  number,  k^  the  wave  number 
of  the  dielectric  coat,  and  a and  b the  radii  of  the  coat.  The 
coefficient  1.5  = kd/k  is  the  refractive  index  of  polyethylene 
and  the  coefficient  0.9  = b/a  is  the  ratio  of  the  two  radii. 


X 

U 

“ 1 

11 

= 0 
V 

J. 

U « 

V 

= 0 

Re  v 

Im  v 

| 

Re  v 

Im  v 

0.5 

-1.1075 

1.3605 

-0.7690 

0.6697 

1.0 

-1.7462 

1.6556 

-1.3273 

0.7794 

1.5 

-2.3282 

1.8668 

-1.8707 

0.6427 

2.0 

-2.8019 

2.0365 

-2.4089 

0.8815 

2.5 

-3.4176 

2.1804 

-2.9475 

0.9033 

3.0 

-3.9399 

2.3063 

-3.4831 

0.9121 

3.5 

-4.4516 

2.4184 

-4.0226 

0.9101 

4.0 

-4.9538 

! 2.5198 

-4.5653 

0.8986 

4.5 

-5.4477 

I 2.6118 

| 

-5.1123 

0.8790 

5.0 

-5.9323 

j 2.6959 

-5.6643 

0.8524 

These  zeros  were  obtained  by  J.  Herder  of  ECOM's  Math. 
Support  Division  using  a Burroughs  B-5700  and  the  Bessel  routine 
provided  by  M.  Goldstein  of  New  York  University.  The  following 
curves  were  obtained  f "om  the  above  data  by  C.  De  Santis  of  the 
Communications  Research  Tech.  Area. 
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1 . INTRODUCTION.  This  is  a review  of  some  classical  problems  in  laminar 
flame  theory  that  essentially  assumes  no  knowledge  of  combustion  by  the  reader. 

Laminar  flame  theory  is  a branch  of  fluid  mechanics  --  essentially  the  motions 
of  die  gases  in  a flame  are  governed  by  the  compressible  Navier-Stokes  equations  -- 
but  there  are  of  course  some  crucial  features  which  are  not  normally  found  in  classical 
fluid  mechanics.  For  one  tiling  one  is  dealing  with  a mixture  of  different  gases  and  it 
is  necessary  to  say  something  about  changes  in  each  of  the  components  of  the  mixture. 
Secondly,  and  most  important,  there  are  chemical  reactions  so  that  there  is  a source 
or  sink  term  in  the  mass  conservation  equation  for  each  component.  Moreover,  heat 
is  released  by  the  chemical  reactions  so  that  there  is  a source  term  in  the  overall 
energy  equation.  These  chemical  reactions  are  extremely  sensitive  to  temperature  -- 
they  usually  won't  take  place  at  all  if  the  temperature  is  too  low  (which  is  fortunate)  -- 
and  an  essential  feature  of  combustion  that  helps  distinguish  it  from  other  branches  of 
aerothermochemistry  is  that  the  high  temperatures  necessary  to  sustain  the  reactions 
are  generated  by  the  heat  released  by  the  reactions  themselves.  Provided  there  is  an 
adequate  supply  of  fuel  and  oxygen,  combustion  is  a self-sustaining  process. 

There  are  two  different  approaches  to  the  theory  of  combustion  that  one  can  take. 
One  is  to  insist  on  being  as  realistic  as  possible  and  retain  in  the  formulation  of  the 
problem  all  the  complexities  that  might  play  a role  in  practice.  This  of  course  leads 
to  equations  of  remarkable  complexity  which  can  only  be  solved  numerically.  Such  an 
approach  has  its  advocates  (and  is  necessary  if  detailed  quantitative  results  are  needed) 
but  a more  fruitful  approach,  given  the  present  state  of  combustion  science,  is  to  strip 
each  problem  down  to  its  fundamentals  and  write  down  mode]  equations  that  are  clearly 
inappropriate  in  reality  but  nevertheless  contain  the  physical  features  which  are  the 
essence  of  the  problem.  The  hope  is  that  the  equations  are  simple  enough  to  solve 
analytically,  or,  if  recourse  to  a computer  is  still  necessary,  simple  enough  so  that 
useful  information  can  be  extracted  from  the  numbers.  Quantitative  accuracy  is  sacri- 
ficed for  qualitative  understanding. 

Actually  there  is  a third  approach  to  studying  combustion  problems  that  has  been 
quite  popular  but  which  should  be  avoided  if  at  all  possible.  One  starts  by  writing  down 
sensible  model  equations  but  then  constructs  what  might  be  called  'model  solutions'. 

That  is,  solutions  are  constructed  using  ad  hoc  irrational  approximations  and  as  a con- 
sequence one  can  never  be  sure  of  the  significance  of  the  end  results.  It  isn't  clear 
whether  the  features  of  the  solution  are  creatures  of  the  original  model  or  of  the  irra- 
tional approximations.  This  makes  systematic  development  of  the  subject  difficult  and 
has  led  to  spurious  results  in  the  past. 


*This  is  a more  or  less  verbatim  transcript  of  a review  that 
for  oral  presentation,  so  that  the  reader  is  asked  to  forgive 
footnotes  were  not  part  of  the  original  presentation  but  have 

clarity. 
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was  specifically  prepared 
the  colloquial  style.  The 
been  added  for  the  sake  of 


Of  course  it  is  clear  why  such  an  unsatisfactory  approach  has  been  popular  -- 
for  many  years  no  rational  systematic  method  of  solving  the  various  model  equations 
was  known  (although  the  literature  is  replete  with  brilliant  ad  hoc  analyses).  But  in 
recent  years  that  has  changed,  and  it  would  probably  be  fair  to  say  that  there 
has  been  a revolution  in  combustion  theory.  At  the  heart  of  this  revolution  was  the 
realization  that  combustion  theory  lias  its  own  unique  asymptotics  which  can  be  exploited 
using  singular  perturbation  theory.  In  particular,  a combination  of  Damkohler  Number 
asymptotics  and  activation  energy  asymptotics,  where  appropriate,  can  often  lead  to  the 
solution  of  model  equations  that  were  for  many  years  thought  too  difficult  to  solve. 

What  I want  to  do  today  is  briefly  describe  the  nature  of  these  asymptotic 
methods,  concentrating  particularly  on  activation  energy  asymptotics;  describe  the 
mathematical  details  of  a particularly  simple  application  of  activation  energy  asymp- 
totics; and  then  describe  a perturbation  procedure  that  generates  nonlinear  solutions 
for  a variety  of  problems,  including  a certain  class  of  unsteady  problems.  In  no  sense 
am  I going  to  attempt  an  exhaustive  review. 

Let  us  start  by  looking  at  a specific  problem. 


2.  QUASI-STEADY  FUEL  DROP  BURNING 


Oxygen 


r 


Figure  1 represents  an  idealized  model  of  a burning  fuel  drop.  The  situation 
is  assumed  steady  and  spherically  symmetric,  conditions  never  realized  in  practice 
which  emphasizes  that  we  are  examining  a highly  idealized  model. 

The  ball  of  fuel,  in  liquid  form,  is  hot  because  of  the  presence  of  the  flame 
and  as  a consequence  it  evaporates,  mixes  by  diffusion  with  the  surrounding  atmosphere 
of  oxygen,  and  then  this  mixture  burns  within  the  flame.  Appropriate  model  equations 
are,  • 


£Y0  * 


M dY0 


2 dr 


_L  A 

2 dr 


2 dY0 


- D1  Y()  Yp  Ta  exp  ( - Jp  ) 3 - u 


RT 


£ Yp  = -w 
ZT  = Qto  . 

These  equations  are  based  on  the  simple  chemical  kinetic  scheme 
[Fuel]  + [Oxygen]  — >•  [Product]  . 

The  kinetics  of  a real  flame  are  much  more  complicated  than  this  but  nevertheless  the 
simple  model  preserves  three  essential  features  --  oxygen  is  consumed,  fuel  is  con- 
sumed, and  heat  is  generated. 

Looking  at  the  equation  for  YQ,  the  mass  fraction  cf  oxygen,  we  see  that  there 
are  three  terms.  The  first  term  is  amass  transport  term  (there  is  a radial  flux  of 
fuel  and  therefore  a mass-averaged  radial  velocity)  and  M is  a measure  of  the  flux 
of  fuel  leaving  the  surface.  It  can  be  regarded  as  the  fundamental  unknown  of  the 
problem. 


The  second  term  is  a diffusion  term. 


The  third  term,  the  chemical  reaction  term,  simply  indicates  that  the  amount 
of  oxygen  consumed  depends  on  how  much  oxygen  is  present,  how  much  fuel  is  present, 
and  the  temperature  T.  The  most  important  part  of  the  temperature  dependence  is 
the  exponential  factor  --  R is  the  gas  constant  and  E is  a constant  known  as  the  acti- 
vation energy.  E tends  to  be  rather  large  so  that  the  reaction  rate  is  very  sensitive 
to  changes  in  the  temperature. 

Dj  is  a parameter  that  depends  on  a number  of  things  including  the  pressure 
(which  is  uniform)  and  is  known  as  the  Damkohler  Number. 

The  equation  for  Yp  is  identical  to  that  for  Y ,,  a consequence  of  assuming 
equal  diffusion  coefficients.  The  energy  equation  (which  is  an  equation  for  the  temper- 
ature since  the  thermal  energy  is  much  larger  than  the  kinetic  energy)  is  very  similar 
(the  Lewis  number  equals  one)  but  the  reaction  term  appears  with  a positive  sign  since 
heat  is  generated  by  the  reaction,  and  the  amount  of  heat  generated  is  characterized 
by  the  parameter  Q. 


+Kassoy,  D.  R. 


& Williams,  F.  A. 


Physics  of  Fluids,  11,  1343  (1968). 
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There  are  appropriate  boundary  conditions  (which  have  not  been  written  down), 
4 at  the  surface  and  3 at  infinity,  making  a total  of  7.  Since  the  system  is  a sixth 
order  one,  these  conditions  are  sufficient  to  determine  the  three  field  variables  and 
M,  which  is  a measure  of  the  burning  rate. 

There  are  many  different  ways  of  characterizing  the  solution  of  this  problem, 
and  one  way  is  to  plot  the  variation  of  M with  Dp 


Fig.  2.  Burning  Response  for  a Fuel  Drop 


U 
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Figure  2 is  typical  of  the  kind  of  response  one  gets  --an  S shaped  curve, 
and  at  the  risk  of  oversimplification  the  turning  points  are  labelled  as  the  ignition 
point  and  the  extinction  point.  The  reason  for  this  is  that  if  the  response  is  on  the 
lower  branch  of  the  curve,  where  the  burning  is  weak,  and  Dj  is  increased  (by 
increasing  the  pressure,  for  example)  then  the  response  moves  to  the  right  until  the 
ignition  point  is  reached  whereupon  any  further  increase  in  D^  causes  a jump  to  the 
top  branch  where  the  burning  is  strong.  A subsequent  decrease  in  Dt  moves  the 
response  to  the  left  along  the  strong  burning  branch  until  the  extinction  point  is  reached 
where  the  response  drops  back  to  the  weak  burning  branch."1" 


The  oversimplification  stems  from  the  possibility  that  the  response  is  forced  off  of 
one  of  the  branches  by  instability  before  the  turning  point  is  reached.  This  happens 
in  chemical  reactor  theory  where  similar  S-shaped  responses  occur  (Cohen,  D.  S. 
& Poore,  B.  SIAM  J.  Appl.  Math. , 27,  416  (1974). 
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Consider  now  the  ends  of  the  curve.  At  the  left  IH  vanishes,  whence  go 
vanishes  and  the  equations  reduce  to  linear  equations  which  can  be  easily  solved.  This 
so  called  frozen  limit  is  of  little  interest  since  there  is  no  combustion. 


The  right  hand  end  (D.  — > oo)  is  much  more  important  since  ty  :al  flames  en- 
countered in  everyday  life  often  have  very  large  Damkohler  numbers,  'he  limit  (called 
the  equilibrium  limit)  is  a singular  one  in  which  the  coefficient  of  tjhe  ghest  derivative 
vanishes,  and  so  thin  layers  (boundary  layers  or  Interior  layers  ) _an  occur.  Outside 
of  these  layers  it  is  apparent,  since  w must  be  finite,  that  as  — > oo  , 


YoV 


0, 


and  so  Yq  and/or  Yp  must  vanish,  w is  then  the  product  of  something  that  goes  to 
infinity  times  something  that  goes  to  zero  and  it  is  clear  from  the  equations  (the  equa- 
tion for  Yq  when  Yq  vanishes)  that  this  product  vanishes  in  the  limit.  In  this  sense 
there  are  similarities  between  the  equilibrium  limit  and  the  frozen  limit,  but  the 
possibility  of  thin  layers  in  the  former  case  is  a crucial  difference. 


Important  though  Damkohler  Number  asymptotics  may  be,  it  obviously  cannot 
tell  us  anything  about  ignition  or  extinction,  so  that  if  we  wish  to  bridge  the  gap  between 
Dj  = 0 and  Dj— » oo  a different  approach  is  necessary.  Activation  energy  asymptotics 
is  an  appropriate  tool.  More  precisely  we  consider  the  solution  of  the  equations  when 


‘CD 


where  Tref  is  same  reference  temperature.  This  is  a realistic  limit  in  many  com- 
bustion situations,  it  can  be  used  to  solve  many  important  combustion  problems, 
and  it  is  mathematically  interesting  because  the  large  parameter  appears  in  an  uncon- 
ventional fashion,  as  the  argument  of  an  exponential. 

One  thing  that  is  immediately  clear  is  that  we  can  not  just  put  E = oo  without 
doing  anything  else  since  that  just  yields  the  frozen  limit  (w  = 0).  Bear  in  mind  that 
we  want  to  determine  how  the  response  changes  with  D^,  and  the  above  observation 
implies  that  only  when  is  very  large  can  we  get  away  from  the  frozen  limit.  What 
we  have  to  do  is  write 

Dx  = exp  (E/RT*) 

where  T*  is  a temperature  that  characterizes  the  magnitude  of  so  that 

r E , 1 1 . . 

w ® exp  [j^  ( 

and  then  the  behavior  of  w in  the  limit  E — > oo  depends  upon  the  relative  magnitudes 
of  T and  T*.  There  are  three  possibilities. 

(i)  In  regions  where  T > T*  the  exponential  goes  to  infinity  in  the  limit,  so  that 

Y Y_  — * 0,  « —*  0,  corresponding  to  equilibrium. 
u F 


+See  Buckmaster,  J.  D.  Combustion  and  Flame,  2A,  79  (1975). 
| | 

Thin  layers  can  occur  in  such  regions  of  course. 
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(ii)  In  regions  where  T < T*  the  exponential  vanishes  so  that  u — ► 0,  a frozen 
situation. 

T-T 

(iii)  Finally,  in  transition  regions  where  T T*  (more  precisely,  * = 

RT  * * 

O ( - -p---  ) ) the  exponential  can  be  simplified  slightly, 


r E (T-T*)  , 
to  oc  exp  [ fi-  ] , 

T* 

but  the  important  point  is  that  to  does  not  vanish  so  that  such  a region  is  a reaction 
zone.  Reaction  zones  are  often  thin  (but  not  necessarily  so)  in  which  case  they  are 
called  flame  sheets. 


Application  of  activation  energy  asymptotics  to  a steady  one-dimensional  prob- 
lem such  as  the  fuel  drop  problem  requires,  in  general,  the  construction  of  solutions 
in  the  three  different  kinds  of  regions  and  matching  them  in  the  usual  way  (that  is,  in 
the  sense  of  matched  asymptotic  expansions).  Usually,  the  most  difficult  part  of  this 
procedure  is  deciding  what  regions  are  needed  and  where  they  are  located.  As  an 
example,  if  we  ask  what  is  the  nature  of  the  solution  for  a point  on  the  middle  branch 
of  the  S-shaped  response  (Fig.  2),  it  turns  out  that  T*  is  the  maximum  temperature. 
That  is,  at  some  finite  value  of  r the  temperature  is  equal  to  T*  so  that  all  the 
reaction  occurs  in  a thin  flame  sheet  located  there,  and  on  either  side  of  the  sheet  the 
combustion  is  frozen  (Fig.  3). 


Flame -Sheet 


Surface 


Fig.  3. 


Typical  Temperature  Distr  ibution  for  a 
Solution  on  the  Middle  Branch 


In  the  frozen  regions  solutions  can  easily  be  constructed  of  the  linear  governing  equa- 
tions. In  the  flame  sheet  the  description  is  nonlinear,  but  because  the  sheet  is  thin 
the  equations  are  simplified.  Matching  the  flame  sheet  solution  with  the  solutions  in 
the  frozen  regions  ultimately  leads  to  the  complete  solution  of  the  problem  and  in 
particular,  the  determination  of  the  burning  rate  M\  A remark  about  the  nature  of  the 
solution  on  the  other  two  branches  will  be  made  later. 


3.  PREMIXED  FLAMES.  The  fuel  drop  problem  is  an  example  of  what  is  known 
as  a diffusion  flame.  There  are  other  kinds  of  flames  in  which  the  reactants  are  sup- 
plied as  a homogeneous  mixture  which  merely  needs  to  be  raised  to  an  adequate  tem- 
perature to  initiate  burning.  Such  flames  are  called  premixed  flames,  a common 
example  being  the  inner  cone  of  a bunsen  burner  flame  (observed  when  the  air  hole 
is  open  which  permits  oxygen  to  mix  with  the  gas  as  it  passes  up  the  tube). 

If  a match  is  applied  to  such  a mixture,  confined  within  a tube,  the  mixture 
will  burn  and  a flame  will  travel  down  the  tube  consuming  the  mixture  as  it  goes. 

Under  ideal  conditions  this  flame  travels  as  a progressive  wave  with  a more  or  less 
well  defined  wave  speed,  and  one  of  the  classical  problems  of  laminar  flame  theory  is 
to  determine  that  wave  or  flame  speed.  What  I want  to  do  now  is  briefly  describe  how 
this  can  be  done  using  activation  ^ergy  asymptotics,  since  this  is  one  of  the  simplest 
nontrivial  applications  of  activati*  energy  asymptotics  presently  known. 

For  a premixed  flame  * # mplest  kind  of  sensible  chemical  kinetic  scheme  is 

[Mixture]  =4>[  Product] 


at  a rate 


w = BY  exp  (-E/RT) 


where  Y is  the  mass  fraction  of  mixture  (a  preexponential  temperature  dependence 
like  the  Ta  that  was  included  in  the  fuel  drop  equations  could  be  inserted  without 
essentially  changing  the  subsequent  discussion). 

The  flame  is  assumed  to  be  one-dimensional  and  the  situation  in  a flame-fixed 
frame  is  shown  in  Fig.  4. 


Hot  Product 


Cold  Mixture 


Y=0  , T=T, 


Y=1  , T=T 

7 a 


> Stationary  Flame 

Fig.  4.  The  One-Dimensional  Premixed  Flame 


The  work  of  A.  Linan,  Astronautics  Acta,  1,  1007  (1974)  on  the  counterflow  diffusion 
flame  provides  an  exhaustive  description  of  calculations  of  this  kind.  Kapila  A.  K. , 
Ludford,  G.  S.  S.  & Buckmaster,  J.  D.  Combustion  and  Flame,  25,  361  (1975) 
describe  similar  calculations  for  a spherical  premixed  flame. 


Cold  mixture  comes  in  from  the  right  and  passes  through  the  flame  where  it 
is  burnt  and  emerges  as  hot  product  on  the  left.  The  reaction  only  stops  when  all  the 
mixture  is  consumed  so  that  Y = 0 on  the  left  and  the  temperature  ther«i*Tj  (>  T^), 
the  so  called  adiabatic  flame  temperature. 

Appropriate  model  equations  are 

nr  * s > D §>  - ^ exP  <-E/RT) 
pv  CP  ^ = s <x  £ > + Q B Y exp  <-E/RT) 

pv  = - m (constant) 
pT  = constant 

which  are  similar,  in  many  respects,  to  the  fuel  drop  equations  written  down  earlier, 
m the  constant  mass  flux,  is  the  fundamental  unknown  being  essentially  the  flame 
speed.  The  equation  of  state  is  simply  a statement  that  the  pressure  is  constant,  valid 
for  low  Mach  Number  flames. 


The  flame  temperature  T^  can  be  determined  without  solving  this  system, 
for  in  passing  through  the  flame  we  know  exactly  how  much  iaf  the  mixture  is  con- 
sumed (all  of  it)  and  we  know  exactly  how  much  heat  is  release?  per  unit  of  mixture 
consumed  (Q).  Therefore  an  overall  energy  balance  requires  \ 


WT«,>  = Q- 

The  system,  when  appropriately  ncn-dimensionalized,  is 
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and  the  essential  idea  is  that  this  system  only  has  a solution  for  a unique*  choice  of 
the  parameter  — 2^—  and  so  in  this  way  the  flame  speed  can  be  determined. 

” S 

An  enormous  amount  of  ingenious  effort  has  been  expended  over  the  years  on 
the  solution  of  this  problem,  and  literally  dozens  of  approximate  solutions  can  be 
found  in  the  literature  each  purporting  to  be  simpler  or  more  accurate  than  earlier 
attempts.  Most  of  this  work  was  rendered  obsolete  in  1970  by  Bush  and  Fendell++  who 
showed  how  the  problem  can  be  solved  rationally  in  the  limit  of  infinite  activation 
energy  (9  — > oo). 


Just  as  for  the  fuel  drop  problem  we  can  not  just  put  0 = oo  in  the  equations  -- 
BA 

it  is  necessary  to  let  — * >co  at  the  same  time.  More  precisely  we  write 

m2cp 


5L-T  92  exp  [- $2  = 0(1) 

L(1+4>00)  *00 


a choice  partly  motivated  by  the  observation  that  we  would  expect,  on  physical  grounds, 
that  the  flame  temperature  (l+^d,)  is  the  maximum  temperature  and  moreover  that  4> 
increases  monotonically  from  4>oq  to  (1  +4>00)  as  the  flame  is  traversed.  Be  that 
as  it  may,  the  problem  is  to  find  $2. 


The  reaction  rate  w is  proportional  to 


exp  [ 


i+4> 

oo 


so  that  wherever  4>  is  less  than  the  flame  temperature  the  reaction  is  frozen  and  the 
governing  equations  are 


d2* 


d<f> 

df 


0 


d2y  + I dY 

^ 


0 


*In  actual  fact  the  system  as  written  doesn't  have  a solution  at  all,  since  the  upstream 
state  Y = 1,  4>  = 4>  is  not  a solution  of  the  equations  (the  so-called  cold  boundary 
difficulty).  The  problem  arises  because  Lhe  temperature  dependence  of  the  reaction 
rate  is  not  accurately  modelled  by  exp  (-0/4>)  when  <(>  is  small.  A realistic  resolu- 
tion of  the  difficulty  is  to  introduce  a cutoff  temperature  lying  between  ^ and  1+4^ 
below  which  the  reaction  rate  is  identically  zero.  No  specific  choice  for  this  tem- 
perature is  needed  when  the  activation  energy  0 is  large,  as  the  subsequent  analysis 
shows. 

^Bush,  W.  B.  & Fendell,  F.  E.  Combustion  Science  & Technology,  J_,  421  (1970). 
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with  elementary  solutions . The  location  of  the  origin  of  coordinates  can  be  chosen 
so  that  these  equations  are  valid  in  £ > 0. 

Noting  that 

Y=0,  4>=i+4>oo 

is  an  exact  solution  of  the  complete  equations,  the  large  scale  structure  of  the  flame 
is  obtained  by  piecing  together  this  exact  solution  and  appropriate  solutions  of  the 
frozen  equations,  as  shown  in  Fig.  5. 


Fig.  5.  Large  Scale  Structure  of  the  Flame 


The  frozen  solutions  in  | > 0 are  chosen  to  ensure  that  the  boundary  conditions 
as  £ ->  oo  are  satisfied  and  that  <(>  and  Y are  continuous  at  the  origin. 

To  complete  the  solution  it  is  necessary  to  analyze  the  thin  region  near  the  origin 
where  the  derivatives  are  smoothed  out.  The  chemical  reaction  is  confined  to  this  re- 
gion, which  is  therefore  a flame  sheet,  and  the  local  solution  has  the  form 


4>~  (!  + + 5 (1  +<t>00)2  0(0  + O(-^j) 

0 

y ~ e y(0+  0(4) 
i * 5<1+V2'- 

In  other  words,  the  flame  sheet  has  a thickness  of  order  O ( i ) but  gradients  there 
are  0(1).  6 


192 


" | 


The  perturbation  quantities  satisfy 

= 4 f — 3-rl  = - nye^—  ‘ 

d C c Mi1  l L (l+4> m r J L (1+4>ot) 

It  follows  that 

y + L (l+'J^)2  <P 

is  a linear  function  Z(i).  But  then  matching  (both  y and  ip  vanish  as  S.-*  - as  to 
match  with  the  solution  behind  the  flame  sheet)  implies  that  £ is  identically  zero.  A 
problem  for  ip  alone  may  then  be  formulated. 

0 = it  - ate* 

dr 

as  l — > - 00  tp  — > 0 


The  latter  boundary  condition  arises  from  matching  ahead  of  the  flame  sheet  (Fig.  5). 
Integrating  once, 

0 = ( ^ )2  - 2 a (*  e^  - e^  + 1) 

and  then  applying  the  condition  as  1— > 00  leads  to  Bush  and  Fendell's  result 


and  completes  the  determination  of  the  flame  speed. 


4.  THE  MODIFIED  PREMIXED  FLAME.  There  are  two  features  of  Bush  and 
Fendell's  solution  that  I want  to  emphasize.  First  of  all,  one  of  the  reasons  that  the 
analysis  is  so  simple  is  that  the  chemistry  free  equations  can  be  so  easily  solved.  This 
suggests  the  following  question.  Suppose  that  we  are  concerned  with  a more  complicated 
problem,  one  related  to  the  one-dimensional  premixed  flame  but  whose  description  re- 
quires additional  terms  in  the  governing  equations.  What  additional  terms  would  lead 
to  chemistry -free  equations  as  easy  to  solve  as  Bush  and  Fendell's?  Such  a question 
obviously  does  not  have  a unique  answer,  but  one  possibility  is 
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A + S = i f (i,  4>,  y,  — > 

de2  d£  0 


g(e,  4»,  y,  — ) 


where  f and  g are  quite  arbitrary.  Perturbation  solutions  of  these  equations  can 
easily  be  constructed. 

The  second  thing  to  notice  about  Bush  and  Fendell's  solution  is  that  •'he  flame 
speed  is  extremely  sensitive  to  the  value  of  the  maximum  temperature.  The  expression 
for  the  flame  speed  (essentially  m)  is 


2L  <!-%,) 


r W _ 

exp  [^] 


and  it  is  clear  that  small  changes  in  the  flame  temperature  (1+<J>  ) will  generate  large 
changes  in  the  flame  speed.  Order  0 (1/9)  changes  in  temperature  are  sufficient  to 
generate  0 (1)  changes  in  the  speed,  for  example.  The  significance  of  the  modified 
equations  written  down  above  is  that  we  might  expect  that  the  0 (1/0)  perturbation 
terms  can  generate  0 (1/0)  changes  in  the  maximum  temperature  and  thus  lead  to  solu- 
tions quite  different  from  Bush  and  Fendell's.  And  yet  we  would  not  expect  the  inclusion 
of  these  terms  to  unduly  complicate  the  analysis. 

Let  us  consider  a simple  example. 

5.  EFFECT  OF  HEAT  LOSSES.  In  any  real  flame  there  are  heat  losses  due 
radiation  or  conduction  to  adjacent  boundaries.  In  a one-dimensional  formulation  these 
losses  can  be  modelled  by  adding  a term  -K  (T-T  )+,  K = constant,  to  the  energy 
equation  so  that  00 

pvCP®  - K<T'V  +QBY  exp ( " llr * • 

The  extra  term  tends  to  drive  the  temperature  to  the  reservoir  value,  and  the  modified 
equations  are  of  the  type  discussed  above  provided  the  magnitude  of  K is  such  that  the 
non-dimensional  term  is  O (1/0). 

The  analysis  is  more  complicated  than  Bush  and  Fendell  but  no  new  principles 
are  involved,  and  defining 


Quite  general  functions  of  T can  in  fact  be  handled  by  the  analysis,  see  Buckmaster, 
J.  D.  Combustion  & Flame  (in  press). 


r 


H = Flame  Speed/Adiabatic  Flame  Speed 


we  find 

(1+4  )2  H2  In  H + K'  =0 
' oo 

where  K'  is  K non-dimensionalized. 

When  K'  vanishes  there  are  two  solutions,  H = 1 (Bush  and  Fendell’s  result) 
and  H = 0,  and  for  moderate  values  of  K'  there  are  two  solutions,  but  if  K’  is  too 
large  there  are  no  solutions  (Fig.  6).  This  principle  has  been  known  for  many  years 


Flame  Speed  vs  Heat  Losses 


and  is  the  foundation  of  the  miner’s  safety  lamp  invented  by  Humphrey  Davy  in  the 
early  19th  century.  The  safety  lamp  consists  of  a naked  flame  surrounded  by  a wire 
gauze  cage,  and  if  this  is  carried  into  a combustible  atmosphere,  the  latter  passes 
through  the  gauze  and  burns  on  contact  with  the  flame.  Without  the  gauze  cage  the 
flame  would  spread  through  the  surrounding  atmosphere,  usually  in  a violent  (explo- 
sive) fashion,  but  the  gauze  is  such  an  efficient  conductor  of  heat  that  the  flame  can  not 
pass  through  it.  Thus  the  miner,  on  seeing  the  flame  flare  up,  can  safely  retreat. 

Looking  again  at  the  response  diagram  (Fig.  6),  recall  that  as  we  move  around 
the  curve  the  maximum  temperature  changes  by  only  an  0(1/0)  amount.  Returning 
to  the  fuel  drop  response  (Fig.  2),  the  top  branch  of  the  curve,  including  the  extinction 
point,  corresponds  to  solutions  for  which  the  maximum  temperature  differs  by  only  an 


Bush  and  Fendell’s  result. 
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0(1/0)  amount  from  the  maximum  temperature  in  the  equilibrium  (D.  — > co)  limit. 

The  lower  branch,  including  the  ignition  point,  corresponds  to  solutions  for  which  the 
maximum  temperature  differs  by  only  an  O (1/0)  amount  from  the  maximum  temper- 
ature in  the  frozen  (D^  = 0)  limit.  Thus  there  is  an  analogy  between  the  C-shaped 
quenching  curve  of  Fig.  6,  and  the  C-shaped  extinction  and  ignition  curves  of  Fig.  2. 

Once  the  idea  of  adding  O (1/0)  perturbation  terms  to  systems  of  flame  equa- 
tions and  looking  for  solutions  tliat  differ  by  an  O (1)  amount  from  the  unperturbed 
solution  is  understood,  there  are  an  infinite  number  of  problems  that  one  can  examine. 
>ne  is  limited  only  by  one's  imagination  in  conjuring  up  different  kinds  of  perturba- 
tions, and  of  course  any  flame  can  be  perturbed,  not  just  the  one-dimensional  premixed 
flame.  Let  us  consider  some  unsteady  examples. 

6.  UNSTEADY  ONE-DIMENSIONAL  PREMIXED  FLAME.  Consider  the  unsteady 
form  of  Bush  and  Fendell's  problem,  for  which  the  equations  are: 

p If  + p v H = 4 <p D ' BY  exP  <-  r!> 
pCp!f  +pvCpH^(0  +QBY  «p<-]gr> 


If  + wa (p  v)  = 0 


P T = constant. 


These  differ  from  the  earlier  equations  only  by  the  addition  of  the  time  derivatives. 
Now  the  steady  flame  has  a characteristic  thickness, 


There  is  a characteristic  velocity,  the  flame  speed, 


and  so  we  can  define  a characteristic  time 


If  we  try  to  solve  the  unsteady  equations  --  as  an  initial  value  problem,  for  example  — 
then  most  disturbances  will  change  on  this  time  scale  and  will  be  governed  by  the  com- 
plete system  of  equations,  without  simplification.  Even  without  chemistry  this  system 
presents  a formidable  challenge.  However,  it  is  conceivable  that  there  are  disturbances 
that  change  on  the  much  larger  time  scale 
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in  which  case  the  time  derivatives  are  O ( i ) terms  and  so  can  be  handled  in  the 
same  way  as  the  small  heat  loss  term.  Indeed  we  find 


2 (l  +4>  )2  H3  In  H +b  ^ = 0 

oo  d T 

<\n  equation  first  derived  by  Sivashinsky.  + Here  j is  time,  H is  the  flame  speed 
ratio  as  before,  and  b is  a parameter  that  depends  upon  the  Lewis  Number  L. 

b < 0 if  L > 1 

b > 0 if  L < 1 


b = 0 if  L = 1 . 

Apparently,  when  L = l^there  are  no  disturbances  that  change  on  the  slew  time 
scale,  an  atypical  situation.  It  is  tempting  when  solving  combustion  problems  to  choose 
L = 1,  since  this  often  leads  to  mathematical  simplification  (the  steady  one-dimensional 
premixed  flame  then  has  unifo^n  enthalpy,  for  example)  but  this  temptation  is  appar- 
ently something  that  should  be  resisted,  at  least  when  dealing  with  unsteady  problems. 

There  are  two  possible  steady  solutions 

H = 0,  H = 1 , 

and  the  stability  of  these  solutions  depends  upon  the  sign  of  b: 

b>0(L<l)  H = 1 stable,  H = 0 unstable, 

b<0(L>l)  H = 1 unstable,  H = 0 stable. 

Thus  if  L > 1,  Bush  and  Fendell's  solution  for  the  one-dimensional  flame  is  unstable. 
It  should  be  emphasized,  of  course,  that  only  the  predictions  of  instability  are  signifi- 
cant. A flame  that  is  stable  to  the  kind  of  disturbances  that  we  have  considered  here 
might  well  be  unstable  to  other  kinds  of  disturbances. 


7.  UNSTEADY  FLAME  WITH  HEAT  LOSSES.  As  we  saw  earlier,  when  there 
are  heat  losses  the  burning  response  is  multiple  valued.  Thus  it  is  of  interest  to  add 
heat  losses  to  the  unsteady  formulation  in  the  hope  of  gaining  insight  into  the  significance 
of  multivalued  responses.  The  result  is 

2 (l+<b  )2  H3  In  H + 2HK'  + b = 0 . 

CD  QT 

Note  that  in  addition  to  the  two  steady  branches  shown  earlier  in  Fig.  6,  there  is  a 
third  steady  solution  H = 0 (Fig.  7). 


197 


Fig.  7.  Stability  of  a Flame  with  Heat  Losses  when  L < 1 


Figure  7 shows  stability  arrows  appropriate  when  L < 1.  These  indicate  the  direction 
the  solution  will  be  driven  in  an  unsteady  situation.  Thus  when  L < 1 the  branches 
AB  and  CD  are  stable,  whereas  CB  is  unstable.  For  L > 1 the  arrows  must  be 
reserved. 


8.  THREE-DIMENSIONAL  UNSTEADY  FLAMES.  The  perturbation  procedure 
is  not  confined  to  one-dimensional  flames.  Three-dimensional  disturbances  can  also 
be  treated  provided  their  nature  is  such  that  the  three-dimensional  terms  are  essentially 
0(1/9).  The  equations  are  rather  more  complicated  since  the  velocity  field  must  be 
determined  and  this  requires  solution  of  the  momentum  equation 

+(v.V)v  = -iVp+v  [V2v+3  V(V  • v)] 
in  addition  to  the  previous  equations. 


Permissable  disturbances  are  defined  in  Fig.  8 (recall  that  the  flame  thickness 


Fig.  8.  Allowable  Three-Dimensional  Disturbances 


The  time  scale  is  the  same  long  time  scale  as  before. 
The  result  for  the  flame  speed  H is+ 

<1+V2  + (j*)2) 

+ 4 g + b(A  +9^,  = 0, 
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This  is  actually  a limiting  result  only  valid  when  the  heat  released  by  the  reaction 
Q is  small  compared  to  the  enthalpy  of  the  unburnt  mixture.  In  general  a single 
equation  governing  the  flame  speed  can  not  be  written  down  when  there  are  three- 
dimensional  disturbances.  Nevertheless,  many  of  the  qualitative  features  of  the 
general  result  are  the  same  as  those  of  the  limiting  result.  The  details  are  in 
Buckmaster,  J.  D.  Combustion  & Flame  (to  appear). 


If  we  look  for  perturbations  of  the  one-dimensional  steady  flame  of  the  form 


X = t + 6 eflT  f (n,t)  > 6 <<  1 


a"f  a2f 

— 2 + — 2 
3*1  3 1 


+ k2  f 


0 


then 


a = 2E  f ' 2 <1+4>oo)2  - V^0+^oo)4+  4b2  fc2  ] * 

If  k vanishes,  the  quantity  in  square  brackets  is  either  zero  or  negative  so  that  we 
recover  the  earlier  result  that  the  flame  is  unstable  if  b < 0 (L  > 1).  But  if  k 4 0 
there  is  a positive  root  irrespective  of  the  sign  of  b,  so  that  the  one-dimensional  flame 
is  also  unstable  if  L < 1. 


The  problem  of  flame  instability  is  an  interesting  and  a complicated  one.  Exper- 
iment suggests  that  sometimes  instability  destroys  a flame,  sometimes  it  merely  causes 
it  to  flicker,  and  sometimes  bifurcation  occurs+.  Most  of  these  observations  are  pres- 
ently unexplained  but  it  is  possible  that  the  above  results  will  play  a role  in  throwing 
light  on  some  of  these  phenomena.  In  general  we  can  expect  activation  energy  asymp- 
totics to  contribute  significantly  to  our  understanding  of  flame  instability.  For  example, 
Matkowsky  and  Sivashinsky"^"  claim  to  have  explained  cellular  flames  in  this  way. 

I shall  conclude  by  making  some  additional  remarks  about  the  long  time  scale 
that  plays  such  an  important  role  in  the  unsteady  problems  discussed  above.  The  point 
is  best  illustrated  by  considering  a specific  problem. 


9.  SOLID  DEFLAGRATION.  The  burning  of  a solid  is  of  fundamental  interest 
in  the  theory  of  solid  propellant  rocket  motors,  and  Fig.  9 shows  a classical  one- 
dimensional model.  The  solid  is  hot,  because  of  the  proximity  of  the  flame,  and  gives 
off  a combustible  mixture  which  burns  within  the  flame.  The  flame  is  propagating  to 
the  left  relative  to  the  gases  but  the  gases  are  moving  to  the  right  and  in  the  steady 
state  the  flame  is  stationary  relative  to  the  solid.  The  burning  rate  depends  upon  the 
pressure  and  a classical  problem  is  the  determination  of  the  steady  state  burning  rate. 

The  flame  is  essentially  the  same  as  that  analyzed  by  Bush  and  Fendell.  There 
are  differences  in  the  problems,  of  course,  owing  to  the  different  boundary  conditions, 
and  a solution  of  the  heat  conduction  equation  has  to  be  constructed  in  the  solid  (which 
is  being  fed  to  the  right  in  a flame-fixed  frame)  but  the  analysis  is  straightforward  and 
the  results  have  some  connection  with  experimental  reality. 


+At  this  point  Fig.  D.  1 (p.78),  Fig.  D.  11  (p.  86)  and  Fig.  D.  10  (p.85  ) from  Markstein, 
G.  H.  Non-Steady  Flame  Propagation  Agardograph  No.  75,  Macmillan,  New  York, 
1964,  were  shown. 

f | 

Private  communication. 

+++See  Buckmaster,  J.  D. , Kapila,  A.  K. , & Ludford,  G.  S.  S.  Astronautica  Acta 
(to  appear). 
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Solid 


Fig.  9.  Burning  Solid 


A more  complicated  problem  is  one  for  which  the  pressure  varies  with  time. 
This  also  is  of  interest  in  the  study  of  solid  propellant  rocket  motors  since  such  motors 
are  often  violently  unstable.  Now  if  the  pressure  varies  very  slowly  with  time,  it  is 
apparent  that  the  response  will  be  quasi- steady.  That  is,  the  burning  rate  will  be  the 
steady  state  value  corresponding  to  the  instantaneous  value  of  the  pressure.  The  ques- 
tion then  arises:  What  is  the  slowest  variation  in  pressure  for  which  there  will  be  a 
significant  lag  in  the  burning  response  and  therefore  significant  transient  effects?  The 
answer  is  pressures  that  vary  on  the  long  time  scale 


t 


9 X p 

O ( — — 
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for  these  will  excite  the  slowly  varying  disturbances.  Indeed,  if  the  appropriate  analy- 
sis is  carried  out  we  find 


r dH 
1 dr 


+ C2  H + C3 


dp 

dr 


+ c4  p = 0 


where  H is  the  burning  rate,  p the  pressure,  and  the  C j are  constants.  The  analy- 
sis is  inherently  a nonlinear  one  but  this  is  the  result  for  infinitesimal  pressure 
variations. 


Flames  are  often  subject  to  external  stimuli  that  change  with  time  and  what  this 
example  suggests  is  that  provided  the  steady  state  solution  is  known,  the  unsteady  prob- 
lem can  be  solved  and  nontrivial  transient  effects  obtained  provided  the  stimulus  changes 
on  the  long  time  scale.  This  could  have  application  to  a variety  of  important  problems. 
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A MODEL  FOR  SHOCK  INDUCED  STRUCTURAL  TRANSFORMATIONS 

Paul  Harris 

Concepts  and  Effectiveness  Division 
Nuclear  Development  and  Engineering  Directorate 
Picatinny  Arsenal 
Dover,  New  Jersey  07801 

ABSTRACT.  The  problem  of  strain  propagation  in  a medium  of  time 
and  strain  (energy)  dependent  elastic  constants  is  considered.  For 
the  elastic  constant  model  considered,  analytic  and  finite  difference 
approximations  appear  to  predict  avalanching  of  the  particle  velocity 
in  a manner  consistent  with  a dynamic  strain  induced  exothermic 
structural  transformation.  The  application  to  enhancement  of  laser 
interaction  with  aerospace  materials  is  discussed. 

1.  INTRODUCTION.  Recent  years  have  seen  increasing  military 
interest  in  the  interaction  of  high  power  optical  signals  (lasers) 
with  aerospace  materials.  A problem  of  particular  interest  has  been 
the  generation  of  a shock  in  an  irradiated  material  in  order  to  pro- 
duce a dynamic  mechanical  deformation  in  an  adjacent  material.  The 
figure  below  illustrates  the  geometry  of  the  problem. 
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The  shock  receiving  medium  could  be  an  explosive,  in  which  case  the 
hardware  application  might  be  a detonator  or  an  explosive  switch. 

For  the  above  type  of  problem  one  would  obviously  like  to  choose 
the  medium  for  shock  generation  so  as  to  maximize  the  generated  shock 
amplitude.  There  are  essentially  two  ways  in  which  the  shock  amplitude 
can  be  maximized  for  a given  optical  signal:  one  can  maximize  the 

strength  of  the  laser  material  interaction,  or  one  can  attempt  to  find 
a generation  medium  which  can  act  as  an  amplifier  of  shock  amplitude 
(the  shock  being  produced  in  approximately  the  electromagnetic  skin 
depth  of  the  generation  medium).  In  this  paper  we  will  mainly  con- 
sider some  mathematical  aspects  of  the  second  approach. 

2.  MATERIAL  SELECTION  AND  PROPERTIES.  Some  alloys  exhibit 
"anomalously"  large  Griineisen  parameters  as  they  undergo  structural 
"phase"  transformations.  Typical  alloy  examples  1,2  are  TiNi  and 
KTaO,.  The  Griineisen  parameter  (proportional  to  the  thermal  expansion 
coefficient)  is  a measure  of  the  pressure  change  caused  by  a change  in 
thermal  energy  density  under  constant  volume  conditions.  Since,  in  the 
absence  of  vaporization  effects,  the  laser  interaction  serves  to  deposit 
thermal  energy  in  the  skin  depth  region,  an  enhanced  Gruneisen  parameter 
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IS  equivalent  to  an  enhanced  pressure  (shock)  amplitude. 
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TiNi  is  an  appropriate  shock  generation  medium  because  its  metallic 
properties,  even  in  the  absence  of  Griineisen  effects,  serve  to  produce 
a small  skin  depth  and  thus  a strong  laser-material  interaction.  The 
observed  1 Griineisen  parameter  enhancement  by  a factor  of  approximately 
twenty  during  the  near  room  temperature  (martensitic)  structural  trans- 
formation  promises  enhancement  of  an  already  strong  laser-material  0 
interaction.  The  practical  limitation  on  the  above  concept  is  a 10  C 
half  maximum  width  for  the  spike  in  the  Gruneisen  parameter,  and  that 
10° C temperature  rise  represents  a rather  small  thermal  energy  density 
deposition. 

The  physics  which  gives  rise  to  the  enhanced  Gruneisen  parameter 
also  results  in  exothermic  (or  endothermic)  effects,  and  different 
elastic  constants  on  each  side  of  the  transition.  While  the  observed3 
exothermicity  of  approximately  6 Cal/gm  is  not  large,  when  combined 
with  the  observed  1 (approximate)  ten  percent  change  in  elastic 
constants,  one  has  a material  which  promises  interesting  thermo-  3,4 
mechanical  effects.  That  interest  is  further  raised  by  the  knowledge 
that  an  applied  strain  can  trigger  the  transformation. 

We  thus  have  a scenario  in  which  a propagating  strain  wave  (shock) 
can  trigger  a structural  transformation,  and  thus  be  amplified  in  the 
process.  It  is  that  secnario  which  we  will  now  model  and  treat  below. 

3.  STRAIN  PROPAGATION  IN  A TRANSFORMING  MEDIUM.  While  there 
exists  a number  2,5  of  elegant  approaches  to  the  physics  of  structural 
phase  transitions,  those  approaches  do  not  yet  appear  capable  of  treat- 
ing the  propagating  strain  condition  of  interest  here.  We  thus  proceed 
somewhat  intuitively. 

Consider  a one-dimensional  strain  problem  (particle  displacement 
only  in  the  direction  of  strain  propagation)  characterized  by 


Po 


(1) 


C = C + a (c,  - C ) 

0 10, 


(2) 


where  p is  mass  density,  u is  particle  displacement,  c is  an  elastic 
constant,  f denotes  a viscosity  functional,  the  subscript  zero  denotes 
the  undisturbed  (prestrain  and  pretransformation)  medium,  the  subscript 
one  dentoes  a final  state  (transformed)  parameter,  and  a is  dependent 
upon  the  degree  of  transformation. 


where  u>  is  strain  energy,  W a constant,  and  t is  a transformation 
incubation  time.  Thus,  to  first  order  in  (u>t) 
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We  will  set  f = o even  though  it  is  known  1 that  attenuation  is  very 
strong  in  the  presence  of  phase  transformations.  We  will  thus  have  to 
keep  in  mind  that  any  u(x,t)  solutions  could  in  practice  be  of  con- 
siderably reduced  amplitude. 

We  will  now  consider  two  approximations  to  Eq.  (5).  The  first  will 
be  relatively  unphysical,  but  analytically  neat.  The  second  will  in- 
volve the  full  form  of  Eq.  (5),  but  will  involve  a rough  finite 
difference  approach. 

APPROXIMATION  I:  We  consider 


P 32U  /,  , 32u 
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0,  6 = const. 


Separating  variables  with  u(x,t)  = T(t)  X (x)  gives 


>o  1 32T  = _ m2  = _0  32X  . 
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If  * ^ yT  = 0>  y-(1  + 6t)- 


Eq.  (9)  is  Airy's  equation  and  its  solutions  can  be  written  as6 
V‘)  ’ AmUl<Y-1>  + BmU2 


where 


m2  1/3 

V-fp  (1  + 6t), 
roc 


with  Uj  and  being  linearly  independent  and  tabulated6. 


We  now  let  k = 2tt/a  = m/ycT  and  evaluate  m2/pQe2.  For  Cj  = cJ2 

and  a)  = W (i.e.  the  strain  energy  taken  equal  to  its  critical  trans- 
formation value) 
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v2  (4x2) , 


(12) 


where  v is  the  velocity  of  sound  in  the  preshocked  medium,  A is  the 
wavelength  of  the  applied  strain  disturbance,  and  c^  = cq/2  corresp 

to  an  exaggeration  of  the  transition  (exothermic)  from  TiNi  (II)  to 
TiNi  (III).  And  using  2nv  = o^x,  where  is  the  angular  frequency 

of  the  applied  disturbance, 

m2 


p jF  = (2V)2 


(13) 


Thus  Y becomes 


Y = (2V)2/3  (1  - §7  )• 


(14) 


For  a particular  we  can  drop  the  subscript  m in  Eq.  (10)  and  write 
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+ BUr, 


(2%x)Zn,  1 j , 


(15) 


J 


and  3u(o,t) 


3t 


A \2/3  ./ 

-“27(2wot)  U1 


19  l2/3  1 

(Zu0tJ  , 1 


t=0 


B (0  <2/3  ' 

27(2V>  U2 


,,  \ V*  1 

2uqT  » 1 


If  we  now  make  the  typical  "hydrodynamic"  approximation  of  u)qt  «1, 
then  from  Eqs.  (15)  and  (16) 

A = u(o,o). 


B =-2(2o)0t)‘2/3 
where  6 


t = -(2o)ot)1/3  u (0 ,0 ) , 


(16) 

(17) 

(18) 


t=o 


U1(0,1)  = 1,  U2(0,1)  = 0 


(19a) 
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r 


Uj  (0,1)  = 0,  U2(0,1)  = 1 

have  been  used. 

We  can  thus  write 

u(o,t)  & u (o ,o ) Uj  (Y.l)  - (2V)1/3  u(o,o)  U2  (Y,l) , 


(19b) 


We  thus  predict  avalanching  of  the  particle  displacement  at  the 
boundary,  u(o,t),  due  to  the  avalanching  behavior  of  U1  (Y,l).  The 

avalanching  is  strong  as  it  is  occuring  even  in  the  presence  of  a 
harmonic  input. 

APPROXIMATION  II.  Here  we  will  consider  a crude  finite  difference 
version  of  Eq.  (5)  written  with  respect  to  an  almost  constant  velocity 
coordinate  system. 

From  Eq.  (5) 


3U  2 

32U  _ , 32U  . o ..  9\  t 3X  32U  , 

St7  V W + (V1  vo  > x n~7  3^ 


where  vQ2  = cq/p0  , Vj2  s Cj/p0  w = M |^  ' , and  W = MnQ2  with 

nQ  being  a critical  strain  value. 

Employing  the  so-called  7 characteristic  stretching  transformation 

C = x - Vt,  c = oVt,  (22) 

where  a is  a dimensionless  stretching  parameter  (we  shall  neglect  terms 
in  a2),  and  defining 


v (e.e)  = ||  , 


we  arrive  at 


2aV2^  + (vo2  - V2)  + (Vj2  - VQ2)  : ^rj 

where  c = aVr. 
o 

Writing  Eq.  (24)  in  crude  finite  difference  form 


= o,  (24) 
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r *<V  W'  ’<V  t™) 


F1  - F?V2  <'„■  J | ’<W  «.>  - ’<V  j 


V 2 - V 2 
0 


(26a) 


_ Vj2  - vQ2  . 
2 = 


(26b) 


Rewriting  Eq.  (25)  with  the  time  derivative  single-stepped  backwards 
gives 


Y(n,m)  - ^(n.m-l)  = Gj  - G2c|t)4'2(n,m)  f(n  +l,m)  + 


G1  - G2V2("'m>  »(n,m)  , 


where 


G1  H I Fl*  G2  E A F2 


f (n+1,0)  = -f—  f(n,o)  - (2< 

G1  J G1 

We  now  set  Y(n,-1)=0  (equivalent  to  turning  the  strain  on  at  t=Q, 
and/or  completely  neglecting  the  stretching  parameter).  With  that 
condition  Eq.  (29)  has  a solution 

1+Gi  1 " 

^(n  ,o)  = -g — H'(o.o).  (3( 


H'(0,0). 


Eq.  (30)  predicts  a geometrical  avalanching  (wave)  in  position,  in 
support  of  the  temporal  avalanching  of  Eq.  (20). 

Experimentally  it  is  known  8 that  the  martensitic  transformation 
in  Fe-29.5%  Ni  propagates  at  a velocity  approximately  one  third  vQ. 

If  we  thus  choose  V to  be  that  velocity  of  propagation  of  the  trans- 
formation, then  Ga  is  large  (a  being  small)  and  negative.  Thus  the 
spatial  avalanching,  while  present,  does  not  appear  to  be  as  strong 
as  the  avalanching  in  time. 


4.  DISCUSSION.  The  two  approximations  considered  above  hint 
strongly  that  shock  amplification  can  occur  in  the  presence  of  a 
structural  transformation.  Considerably  more  work  is  necessary, 
however,  before  the  prediction  of  an  amplification  factor  is  possible. 

In  closing  we  will  briefly  list  what  we  believe  to  be  the  promising 
approaches  for  future  work. 

(a)  Modeling.  The  inclusion  of  microscopic  effects  (e.g.  soft 
phonon  and  interatomic  potential  effects)  in  the  modeling 
of  a. 

(b)  Attenuation.  It  is  conceivable  that  known  strong  attenuation 
during  the  transformation  process  could  severly  limit  the 
predicted  amplification.  While  experimentally  1 determined 
attenuation  factors  in  Ti Ni  lead  us  to  believe  that  this  is 
not  the  case,  f ^ 0 must  be  included  at  least  for  completeness. 

(c)  Soliton  propagation.  The  current  fad  in  spatially  bounded 
non  linear  propagation  effects  involves  soliton  physics. 

It  is  necessary  to  seek  solutions  of  Eq.  (5)  from  such  a 
point  of  view. 

(d)  Finite  differencing.  It  is  necessary  to  refine  the  work 
of  approximation  II. 
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ABSTRACT » It  takes  of  the  order  of  N operations  to  solve  a set 

of  N linear  equations  in  N unknowns.  When  the  underlying  physical 

problem  has  some  time-  or  shift-invariance  properties,  the  coefficient 

matrix  is  of  Toeplitz  (or  difference  or  convolution)  type  and  the  equations 

2 

can  be  solved  with  0(N  ) operations.  We  have  shown  that  with  any  non- 
singular N x N matrix,  we  can  associate  an  integer  0L  between  1 and 


N such  that  it  takes  O(N2Q0  operations  to  invert  the  matrix.  The  number 
a may  be  small  for  many  non-Toeplitz  matrices  of  physical  interest.  Some 
aspects  of  this  result  are  discussed  here,  including  extensions  to 
continuous-time  kernels  and  integral  equations. 

1.  INTRODUCTION.  Problems  in  many  fields  lead  ultimately  to  the 
solution  of  linear  matrix  equations 

Ra  = m , 

where  R is  a given  N x N matrix,  say,  and  m is  a given  N x 1 vector. 

The  number  of  operations  required  to  solve  such  an  equation,  or  to  find 
-1  3 

R , is  of  the  order  of  N (multiplications  and  additions)  . This  can  be 
prohibitive  if  N is  large  (500  or  1000  or  3000,  as  can  arise  in  many 
power  system  or  econonometric  calculations).  For  this,  and  other  reasons, 
we  must  often  try  to  bring  in  any  special  features  or  structures  that  may 
be  present  in  the  original  physical  problem.  In  many  applications 

^This  report  is  a summary  of  a talk  given  at  the  22nd  Conference  of  Army 
Mathematicians,  Watervliet  Arsenal,  New  York,  May  1976.  It  was  based  on 
work  done  jointly  with  B.  Friedlander,  L.  Ljung  and  M.  Morf  (see  the 
references) . 


This  work  was  supported  by  the  Air  Force  Office  of  Scientific  Research, 
Air  Force  Systems  Command  under  Contract  AF44-620-74-C-0068 , and  in  part 
the  .Joint  Services  Electronics  Program  under  Contract  N00014-75-C-0601 , 
and  the  National  Science  Foundation  under  Contract  NSF-Eng  75-18952. 
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we  have  the  property 


R = [r  .]  = [r.  .] 

U i-JJ 


That  is,  the  phenomena  are  invariant  to  a change  in  the  time-  or  space- 

origin  (e.g.,  as  with  stationary  random  processes,  or  homogeneous  media, 

etc.).  In  this  case,  the  matrix  R is  said  to  be  a Toeplitz  matrix  and 

2 

ha$  the  nice  feature  that  its  inverse  can  be  found  with  only  0(N  ) multi- 
plications. Moreover  the  inverse  can  be  computed  recursively,  i.e.,  the 
N x N inverse  can  be  easily  updated  to  yield  the  (N  + 1)  x (N  + 1) 
inverse,  and  Toeplitz  matrices  also  have  other  useful  properties. 

Unfortunately,  most  operations  on  Toeplitz  matrices  destroy  the  Toeplitz 
property.  For  example,  the  inverse  of  a Toeplitz  matrix  is  not  Toeplitz, 
unless  the  matrix  is  also  lower-  or  upper-triangular.  So  also  the  product 
of  two  Toeplitz  matrices  is  not  Toeplitz,  unless  the  matrices  are  also 
both  lower-triangular  or  both  upper-triangular.  However,  some  reflection 
will  show  that  in  various  ways  one  can  regard  certain  matrices  as  being 

"less  non-Toeplitz"  than  others,  though  present  solution  methods  cannot 

2 

take  advantage  of  this — they  require  0(N  ) operations  in  the  Toeplitz 

3 

case,  and  0(N  ) otherwise. 

By  a long  process  of  abstraction  and  simplification  of  results  originally 

obtained  for  certain  nonlinear  differential  equations  [1  ],  [2  ],  we  have 

been  able  to  show  essentially  the  following  (more  precise  results  are 

stated  later) : with  any  invertible  N x N matrix  R we  can  associate 

2 

an  integer  a,  1 < a < N,  such  that  it  takes  0(N  Ct ) operations  to 
compute  its  inverse.  The  integer  a may  be  called  the  displacement  rank 
(or  index  of  nonstationarity)  of  the  matrix  and  has  the  property  that  it 
is  low  for  matrices  that  are  Toeplitz  or  near  to  Toeplitz,  while  it  is 


high  for  arbitrary  matrices.  For  example, 
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i) 

a = 

l 

for 

R = L or  U or  LU  or  UL,  where  L and  U 

denote 

lower-  and  upper-tr iangular  Toeplitz  matrices. 

ii) 

a = 

2 

for 

R = (L  + U)  and  R = (L  + U)  1 

ill) 

a 5 

4 

for 

R = (Lx  + Ux) (L2  + U2) 

iv) 

a < 

3 

for 

R = [Li  + Ux  : L2  + U2] 

v) 

a < 

n, 

if 

R is  the  covariance  matrix  of  a linear  combination 

of  the  components  of  any  n-vector  wide-sense  Markov  random 
process . 

2 3 

In  such  cases,  0(N  O)  can  often  be  signif icantly  less  than  0(N  ), 

thus  yielding  many  advantages,  not  just  for  solving  a given  large  set  of 

equations,  but  also  for  interactive  adjustment  of  the  mathematical  model 

(i.e.,  of  R and  m)  based  on  actual  examination  of  the  now-more-easily 

determined  solution  a. 

We  shall  outline  our  major  results  in  Section  2,  for  matrix  equations. 


A similar,  and  somewhat  simpler,  development  can  be  carried  out  for 
integral  equations,  as  noted  in  Section  3.  Section  4 contains  some  con- 
cluding remarks  on  possible  extensions  and  generalizations. 
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2.  THE  MATRIX  CASE.  More  details  can  be  found  in  the  paper  [ 3] 


but  we  note  the  key  definitions  and  results  here. 
Definition  1.  The  (+) -displacement  rank  of  an  S x N 
smallest  integer  Q!+(R)  such  that  we  can  write 

a+(R) 


for  some  lower-triangular  Toeplitz  matrices  {L^}  and 
Toeplitz  matrices  {U^ } . 

Definition  2.  The  (-) -displacement  rank  of  an  N x N 
smallest  integer  a (R)  such  that  we  can  write 


R 


a (R) 

~L 

1 


U 


£ 

ii 


for  some  lower-triangular  Toeplitz  matrices 
Toeplitz  matrices  {U^ ) • 

Definition  3.  Let 


and 


Z = the  lower-shift  matrix 


Lemma  1 . Computation  of  Displacement  Ranks 

a+(  R)  = p{J(R)} 

where 


J (R)  = R - ZRZ ' f 


and 


Also 


p{A}  = the  rank  of  the 
Q (R)  = e{?(R)} 


matrix  R is  the 


some  upper- triangular 


matrix  R is  the 


upper- triangular 


(1) 


matrix  A. 
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where 


F(R)  = R - Z'RZ  . (2) 

The  proof  follows  by  using  the  result  of  Lemma  2. 

Lemma  2.  Given  two  column  vectors  x, y there  is  one  and  only  one  solution 
of  the  functional  equation 

J(R)  = xy'  , (3a) 

and  this  is 

R = L(x)U(y')  , (3b) 

where  ' denotes  transpose,  L(x)  is  a lower-triangular  Toeplitz  matrix 
whose  first  column  is  x,  and  U(y')  is  an  upper-triangular  Toeplitz 
matrix  with  first  row  y' . 

Proof.  For  uniqueness^  note  that 

J(RX)  = J(R2) 

implies 

R1  - ZR12'  = R2  " ZV 

or 

R:  - R2  = Z(RX  - R2)Z’  , 
whose  only  solution  is  clearly  zero. 

The  rest  amounts  to  verifying  that  jL(x)U(y')  = xy' , which  the 
reader  may  find  amusing  to  check  by  direct  computation  for  3x3  matrices.  SI 
Lemma  1 now  follows  easily  from  the  observation  that 

a a 

R = I L(x.)U(y!)  a*  J(R)  = E x y!  (4) 

1 1 i l 

We  can  now  state  a first  simple(but  apparently  new, result. 

Theorem  1 . 

0_(R_1)  = .*  (R)  . (5) 

Therefore , 
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; >* 


Proof.  We  give  the  simple  proof  (suggested  by  S-Y . Kung)  because  it 


shows  that  the  result  is  quite  general  and  depends  very  little  on  the 
nature  of  the  entries  of  R — for  example,  they  could  themselves  be  matrices. 
We  note  that 

a_(R-1)  = c (R_1  - Z ’ R_1Z  } 

= o { (R**1  - Z'R_1Z)R) 

= c{I  - Z'R_1ZR) 


since  rank  is  unaffected  by  multiplication  by  a nonsingular  matrix.  Now 
by  a well-known  matrix  result  that 

p(I  - AB)  = p (1  - BA} 
we  can  continue  the  above  chain  as 

a_(R-1)  = p (I  - ZRZ'R-1} 

= c f (I  - ZRZ  * R_1) R } 

= c {R  - ZRZ ' } 

= a+(R)  • ■ 

Example.  If  T is  a symmetric  Toeplitz  matrix,  then  a (T)  = 2 = a (T) 
since  we  have  the  representations 


where 


T = T «I  + I-T' 

+ + 

= I .T  + T'  • I , 
+ + 


The  fact  that 


a (T)  = 2 = a (T) 

+ - 


can  also  be  seen  by  checking  that 
JCD  = T - ZTZ' 

t„  ...  t 


N 


-|  , for  all  N > 2 


o 


and 


« 


| , 

i 

t 

u 


K; 


r(T)  = 


O 


N 


N-l 


N > 2 . 


N 

Now  it  turns  out  to  have  been  well-known  in  many  contexts  (see  the 
discussion  in  [4  ])  that  there  exist  two  lower-triangular  Toeplitz 
matrices  A and  B such  that 

T_1  = A'A  - B’B  (7) 

so  that 

a_(T) 

Remark.  Notice  that  the  displacement  ranks  seem  to  identify  a better 


2 = a+  (T) 


property  of  matrices  than  their  being  Toeplitz.  The  class  of  Toeplitz 
matrices  is  not  closed  under  inversion,  unlike  the  (+) -displacement  ranks 
and  the  corresponding  representations  (6  )• 

2 

Theorem  2.  The  inverse  of  an  N x N matrix  R can  be  found  with  0(N  CO 


multiplications,  where  a is  an  integer  such  that  Q+  < C:  < + 2.  This 
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can  be  done  via  certain  recursive  formulas  called  the  generalized  Szego 


3.  INTEGRAL  EQUATIONS.  The  Fredholm  integral  equation  of  the  second 


a(t)  + / K(t, s) a (s)  ds  = m(t)  , 0 < t < T 


has  been  extensively  studied,  see,  e.g.,  the  recent  monograph  [12  ].  Except 
for  the  handful  of  cases  where  explicit  analytic  solution  is  possible,  the 
generic  technique  is  to  replace  the  integral  equation  by  some  approximating 


set  of  N linear  equations 


Ra  = m . 


This  can  be  done  in  various  ways — use  of  degenerate  kernels,  projection 

(Galerkin  and  collocation)  methods,  etc.  For  example  in  the  degenerate 

kernel  method  we  replace  K(t,s)  by  the  function 

K^t.s)  = z (^(tH^s)  (9) 

for  some  suitably  chosen  functions  [<A^  ( •)  , \|r  ( •)  }.  In  any  case,  the 

3 

resulting  set  of  linear  equations  will  in  general  require  0(N  ) operations 
for  their  solution  and  this  may  be  prohibitively  large.  More  significant 
however  is  the  observation  that  such  approximation  methods  will  generally 
destroy  any  nice  structure  that  might  have  been  present  in  the  original 
problem. 

For  example,  if  the  kernel  was  of  Toeplitz  (also  called  displacement 


or  convolution)  type, 


then  in  general 


K(t,s)  = K(t-s)  , say 


K„(t,s)  4 Toeplitz  for  N < ••  . 

N ' 


This  is  bad,  because  the  Toeplitz  property  can  be  exploited  to  find  a 
nice  solution  of  the  integral  equation.  Briefly,  first  define 
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H(t,s)  = the  Fredholm  resolvent  of  K(t,s) 


as  the  solution  of  the  integral  equation 

T 


HT(t,s)  +J  HT(t,r)K(r,s)  dr  = K(t,s)  , 0<t,s<T  . 


In  operator  notation,  we  can  write  this  as 


H + HK  = K 


(I  - H)  (I  + K)  = I 


which  shows  that  the  original  equation 


can  be  resolved  as 


(I  + K)a  = m 


a = (I  + K)  m = (I  - H)m 


a(t)  = m(t) 


-/v‘. 


s)m(s)  ds  . 


Therefore  the  basic  problem  is  to  find  H(t,s).  Now  even  though  K(t  - s) 
may  be  Toeplitz,  this  will  not  in  general  be  true  of  its  resolvent  H^tjS) 
(for  T < ••) . Nevertheless  H^,(t,s)  is  not  a completely  arbitrary  kernel, 
but  should  in  some  sense  be  close  to  a Toeplitz  kernel  (after  all,  its 
resolvent  is  Toeplitz) . 

We  can  quantify  this  intuitive  feeling  in  the  following  way  (the 
analog  of  the  method  used  in  Section  2) , Define  the  operator 


and  note  that 


JK(t,s)  = + ^)K(t’S)  ' 


jK(t  - s)  = 0 . 


If  K(t,s)  is  not  Toeplitz  jK(t,s)  d 0,  but  it  will  be  some  function 
of  two  variables f which  we  can  write  as 
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rs 


* 


a 


0.  (t)\|r.(s) 


(ID 


jK(t,s)  = E 01l  ( 

for  some  functions  {0^0  t <1^(0  } and  some  integer  a,  possibly  even 

infinite.  However  let  us  define  the  displacement  rank  of  K(t,s)  as 

the  smallest  integer  a(K)  such  that  the  representation  (11)  is  possible. 

Examples.  i)  K is  Toeplitz,  Q!(K)  = 0. 

ii)  K(t,s)  = min(t,s),  the  covariance  of  the  simplest 
nonstationary  random  process,  the  Wiener  process. 

Clearly  jK(t,s)  s 1 and  <2=1. 

iii)  K(t,s)  = ts  - min(t,s) , the  covariance  of  the 
so-called  Brownian  bridge  process.  Now 
JK (t, s)  = s + t - 1 and  a = 2.  ■ 

We  can  show  the  following  result,  analogous  to  Theorem  1 in  the  matrix 


case , 


; i 


* 

X 

\ 

- 

it 

i « 

t' 


#• 


Theorem  3.  CKH^t.s))  < a( K(t,s)  + 2. 

Example.  When  K is  Toeplitz,  a(K)  = 0.  However  even  though  its 
resolvent  HT(t,s)  is  not  Toeplitz,  there  exist  two  functions  AT(*), 


Bt(-)  such  that 


Jl^Ct.s)  = AT(t)AT(s)  - Er<t)BT(s), 

so  that 

a(HT(t,s))  = 2 . 

Moreover  the  functions  A^O  and  BT ( • ) , of  one  variable,  can  be 
determined  more  easily  than  functions  of  two  variables.  In  fact  they 


(12) 


can  be  obtained  via  the  differential  equations 


(If  ♦ lf)v*>  - - v*>  VT>  ■ 


0 < t < T 


(13a) 


5 


3T  BT(t)  = 


AT(t)BT(t)  , 


° < t < T (13b) 

with  certain  easily  determined  boundary  conditions  A (0)  and  BT(T) . 

This  is  the  analog  of  (7)  in  the  matrix  case. 


.) 
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s 


s 

X 

ii 


The  point  is  that  these  differential  equations  can  be  solved  by  a simple 
recursive  procedure,  which  needs  only  proportional  to  N2  operations, 
where  N is  the  number  of  points  in  [0,T]  used  in  any  discretization 
procedure. 

• • 

We  call  (13)  Krein-Szego-Levinson  equations  because  they  are  exactly 
the  recursions  found  by  Krein  [ 13]  for  the  continuous  analogs  of  the 

Szego  polynomials  on  the  unit  circle. 

Theorem  4.  If  K(t,s)  has  displacement  rank  a,  H^t.s)  can  be  found 

with  a times  as  much  computation  as  in  the  Toeplitz  case.  The  solution 

• • 

is  found  recursively  via  a set  of  generalized  Krein-Szego-Levinson 
equations. 

Proofs  and  further  results  can  be  found  in  the  papers  [14]-[15], 

However,  we  might  draw  explicit  attention  to  the  fact  that  though  we 

are  using  a degenerate-kernel  representation  in  (11),  this  is  for  jK(t,s) 

and  not  for  K(t,s).  Even  though  Jx(t,s)  is  degenerate  it  can  be  seen 

by  integration  that,  in  operator  notation, 

Cffl 

K = E L U 
1 11 

where  the  {L^ } and  (IL)  are  lower-  and  upper-Volterra  operators. 
Therefore  K can  be  very  far  from  a degenerate  kernel.  The  feature 
of  our  method  is  that  it  preserves  any  "Toeplitz-like"  structure  that  may 
be  present  in  K(t,s).  This  thought  is  pursued  a bit  further  in  Section  4. 


4.  CONCLUDING  REMARKS.  We  have  taken  Toeplitz  kernels  as  basic 
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because  they,  or  things  close  to  them, arise  in  many  applications  of  interest 
to  us.  However  in  other  problems,  other  "nice"  kernels  may  be  more  basic. 
For  example,  we  might  have  Hankel  kernels 

K(t,s)  = K(t  + s)  , say  . 

Integral  equations  with  such  kernels  can  be  solved  efficiently,  and  there- 
fore it  may  be  of  interest  to  classify  kernels  in  terms  of  their  degree 
of  "non-Hankelness" . This  can  clearly  be  done  as  above  by  using  the 
operator 

(It  ' ^)K(t'S>  ' 

which  gives  zero  when  applied  to  Hankel  kernels.  Similar  results  can 
also  be  obtained  for  basic  kernels  of  the  form  K (t  - s)  + K„  (t  + s) . 

Furthermore  we  could  also  define  "second"  and  higher-order  operators 
of  the  type 

j2{K(t,s)}  = (^  + K(t,s) 

and  so  on.  It  is  easy  to  find  examples  where  these  are  particularly 
appropriate. 

As  a final  comment,  we  should  express  our  feeling  that  the  basic 
ideas  described  above  should  be  adaptable  to  a variety  of  different 
situations.  Also  there  is  clearly  some  quite  general  algebraic  structure 
lurking  behind  our  results,  which  some  of  the  people  in  this  audience 
may  be  better  equipped  to  identify  than  we  can. 
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AN  EXACT  SOLUTION  TO  AN  ELASTIC-PLASTIC  DEFORMATION 
PROBLEM  IN  A RADIALLY  STRESSED  ANNULAR  PLATE 
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ABSTRACT.  An  exact  solution  to  the  small  strain  contained 
plastic  deformation  problem  in  an  annular  plate  under  internal 
pressure  is  obtained  on  the  basis  of  the  deformation  theory  of  Hencky, 
the  Mises  yield  criterion  and  a modified  Ramberg-Osgood  law. 

Expressions  for  the  stresses,  strains  and  displacement  are  given. 

Some  numerical  results  have  been  worked  out  and  assessed  by  using  the 
Budianky's  criterion  for  the  acceptability  of  the  deformation  theory. 

1 . INTRODUCTION.  The  problem  is  a partly  plastic,  annular 
plate  radially  stressed  by  uniform  pressure.  The  material  is  assumed 
to  be  elastic-plastic  and  obeying  the  Mises  yield  condition.  For 
ideally  plastic  materials,  the  stress  solution  for  this  problem  was 
first  obtained  by  Mises  [1]  and  the  corresponding  two  strain  solutions 
were  recently  obtained  by  the  present  author  on  the  basis  of  both  J? 
deformation  and  flow  theories  [2].  The  numerical  results  obtained  by 
using  these  two  theories  indicate  that  the  strain  differences  are 
very  small  and  compressibility  of  the  material  should  be  considered. 
However,  there  is  no  published  solution  for  strain-hardening  materials, 
which  is  the  purpose  of  the  present  investigation. 

In  the  present  paper,  an  exact  elastic-plastic  solution  for 
strain-hardening  materials  is  given  on  the  basis  of  J?  deformation 
theory  together  with  a modified  Ramberg-Osgood  law  [3 J - Exact 
solutions  based  on  this  particular  model  were  given  recently  to  an 
infinite  sheet  having  a circular  hole  under  uniform  external  tension 
[3]  and  internal  pressure  [4].  This  paper  considers  annular  plates 
of  arbitrary  inner  and  outer  radii.  Some  numerical  results  are 
presented  and  the  limitations  of  the  solution  are  discussed. 

2-  BASIC  EQUATIONS.  Assuming  small  strains  and  neglecting 
inertia  forces  in  the  axisymmetric  state  of  plane  stress,  the  radial 
and  tangential  stresses,  ar  and  a0,  must  satisfy  the  equilibrium 
equation, 


ae  = 0/3r)  (ror) 


(1) 
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and  the  corresponding  strains,  er  and  e0,  are  given  in  terms  of  the 
radial  displacement,  u , by 


er  = 9u/3r  , e0  = u/r  (2) 

We  shall  assume  that  the  material  is  elastic-plastic,  isotropic, 
obeying  the  simple  deformation  theory  and  the  strains  are  related  to 
the  stresses  by 

er  = E_1(or-va0)  + (Es"1-E'1)  (or-  j a0)  (3) 

e0  = E_1(a0-vor)  + (Es-1 -E-1 ) (o0-  1 ar)  , (4) 

where  E,  v are  elastic  moduli  and  E$  is  the  secant  modulus  on  the 
effective  stress-strain  curve  with  E|  = o/e  and 

o = (ar2  + oq2  - ar  o0)1/2  . (5) 

If  a modified  uniaxial  relation  of  the  Ramberg-Osgood  type  is  assumed 
[3,  4],  we  have 

Es_1  = E"1  for  o < ay;  Es  1 = E_1  (a/ay)11-1  for  a * ay  (6) 

and  the  initial  yield  surface  is  defined  by  the  ellipse  o = oy. 

Since  the  compressibility  of  the  material  is  taken  into  account, 
the  longitudinal  strain  e^can  be  determined  by 

er  + e0  + ez  = E_1(l-2v)(ar  + a0)  , (7) 

which  holds  in  the  elastic  as  well  as  plastic  region. 

The  boundary  conditions  on  the  problem  are 

°r(a>  t)  = - P , ar(b,  t)  = 0 . (8) 

Where  a,  b and  P are  the  inner,  outer  radii  and  internal  pressure, 
respectively.  In  addition;  all  stresses,  strains,  and  displacement 
■ must  be  continuous  throughout  the  entire  region. 

In  the  following,  the  solutions  will  be  presented  in  terms  of 

\ nondimensional  quantities  defined  by 

** 

ii  a = a/b,  £ = r/b,  6 = p/b,  p = P/av  , 

r*  * 

- 0p/0y,  Sq  — Oq/ Oy  , S ~ of  0y  , 

= Eer/ay  , e0  = Ee0/a^,  ez  = Eez/ay  . (9) 


where  r = p locates  the  elastic-plastic  boundary. 


3.  ELASTIC  REGION.  For  small  pressure  (p  £ p*),  the  plate 
will  be  elastic  throughout  (a  < {;  < 1 ) and  the  solution  is 


= p(ot-2-l  )-n  (1  T Cc)  , 


-2, 


= p(a'2-lj1[(l-v)  + (1  + v)£  fc] 


-2- 


-0 


ez  = -2p  (cf2-l)_1  v . 

The  critical  value  p*  to  cause  incipient  deformation  is 

p*  = (1-a2)  [3  + a4r1/2 


(10) 


(ID 


For  values  of  p larger  than  p*.  the  plate  becomes  plastic  in  the 
inner  region  {a  < £ _<  3)  and  is  still  elastic  in  the  outer  region 
(3  £ £ £ 1).  In  the  outer  elastic  region,  the  equations  for  the 
dimensionless  stresses  and  strains  are 


= (1  + S"2)/(l  + 30'4)172 


r 

e0 


-2- 


> = [(1  - v)  + (1  + v)e  ]/(l  + 33  ) 


-4,1/2 


? 

I - 

t f 

Tt 


ez  = 


2 v/(l  + 33'4)1/2  • 


(12) 


4.  PLASTIC  REGION  (a  _<  £ < g,  p**  >_  p _>  p*).  Following  Nadai 
for  isotropic  problems  [5],  we  Tntroduce  the  parametric  representation 
(0  < <f>  < tt/2) 


Sr  = -S  cos4>/sin(-rr/3) 

S = -S  cos (cp  + 26)/sin(iT/3) 

O 


(13) 


Ti 


f 
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which  satisfies  equation  (5)  identically  and  leads  to  the  following 
equation  upon  substituting  into  the  equation  of  equilibrium, 

C_1d5  = [sin(ir/3)(tan(ir/6)  + tan<|>)]_1  (tan<t>d<(>  - S"*ds)  . (14) 


By  the  extended  Mitchell  theorem  [6],  the  stress  solution  for  the 
present  problem  is  independent  of  v.  So  choose  v = 1/2  and  then 
equations  (3),  (4),  (6)  and  (9)  lead  to 

er  = (Sr  - S0/2  )Sn-]  , 

e0  = (S0  - Sr/2  )Sn‘1  . (15) 

The  compatibility  equation  follows  from  (2)  and  (9)  as 

er  = (9/9£)(£eg)  . (16) 

Substituting  (15)  into  (16)  with  the  aid  of  (13),  we  can  obtain 

£_1d£  = [-sin(ir/3)(cot(ir/6)  + cot<J>)]-1  (cotepd <p  + nS_1dS)  . (17) 

Combining  (14)  and  (17)  yields 

S'^dS  = ( tancp  + tan(u/6))/(l  - n tan<f>  tan(ir/6))  . dip  (18) 


which  can  be  integrated  with  the  known  condition  at  the  elastic- 
plastic  boundary.  Since  S and  <p  are  functions  of  £ and  8>  the 
notation  S™  = S(£,6),  <pro  = 4>U»3)  are  introduced  [2].  After 
some  manipulation , the  relation  between  and  is  given  by 


n sin<))gg  - /3  cos<J> 


.“in  r 


68 


j_n  sin<j)^g  - SS  cos<p^g [ 

where  y = (n  + 3)/(n2  + 3), 


exp 


(n-1  )/5 
n^+T- 


^88  ‘W 


(19) 


and 


tancpee  = (B2/v^  + /5)/(l-62) 
follows  from  (12)  and  (13)  at  £ = 8. 


(20) 


Substituting  (18)  into  (14)  and  carrying  out  the  integration  with 
the  known  condition  at  the  elastic-plastic  boundary,  we  have 

(8/£)2  = F(<t>C6) 
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and 


sin(<(>  -Hr/6) 

F(*5B)  “ 


sin(d>ee+7T/6) 


x exp 


n sln^g  - /?  cos^ 
n si n<t>  - /5  cos* 

t,p  >ip 


(n2-l)  3 


2X, 

n +3 


(*ee  ' *ee) 


4n 

n2  + 3 


(21) 


from  which  <jvg  can  be  solved  as  a function  of  £ and  6.  At  the  inside 
surface,  £ - a,  <p  = <p  thus  the  expression  relating  a, 6 and  p can 
be  written  parametrically  as 

p = Sag  cos<t>ag/sinU/3) 

(B/a)2  = F(<j>ag)  , (22) 

where  Sag  and  F(<pa$)  are  given  by  (19)  and  (21),  respectively.  By 
examining  (19)  and  (21),  it  can  be  found  that  Sag,  p,  3/a-«°  as 
^ag-^o  = tan"  (v'J/n)  for  finite  n.  It  should  be  noted  that  for  the 
present  problem  we  always  hive  <!>agl4>^gl4>gg  and  $n<4>otg<4w,<4>gft<(t>i i = t/2. 

Now  we  have  completed  the  stress  solution  which  is  given  by  (13), 

(19),  (20),  (21)  and  (22). 

The  solution  for  the  strains  in  the  plastic  region  (a<£<3,  p>p*) 
of  an  elastic-plastic  (finite  n)  plate  can  be  obtained  from"(3)»  (4) 
and  (7),  using  (6),  (9)  and  the  above  stress  solution.  After  some 
manipulation,  the  equations  for  the  dimensionless  strains  can  be 
written  as 

er  = "SCB  si n( 4>^3+^/3)  -S?ecos(<f>?B+TT/3)(l-v)/sin(TT/3) 

n 1 

e0  = s1n$£g  - coscp^g( 2~ v)/ s i n ( 7r/3 ) 


ez  = [Sjp  - (1  -2v)S^g]  cos(^g+ir/6)  (23) 

where  S and  <j>  R can  be  evaluated  as  functions  of  E,  and  B by 
equations  (19),  f20)  and  (21). 
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5.  DISCUSSION  OF  RESULTS.  Since  the  deformation  theory  is 
used,  the  validity  of  the  above  solution  should  be  assessed  by 
applying  Budiansky's  criterion  [7]  which  requires  the  following 
inequality  to  be  satisfied. 

[(nsn-1-l)/(sn"^-l)]1^2  > (n  tan<p  - /3)/(/3  tan<|>  +1)  (24) 

For  any  values  of  n,  the  ranges  of  S and  4>  over  which  the  inequality 
may  not  be  valid  can  be  determined.  In  the  present  case,  the  above 
inequality  is  satisfied  except  over  a certain  range  of  S and  ip  for 
n>l 7 [4]. 

Another  limitation  of  the  above  solution  is  due  to  the  small 
strain  assumption.  In  the  case  of  annular  plates  with  arbitrary 
ratio  of  inner  to  outer  radius  a , there  may  exist  two  types  of 
plastic  flow.  Full  plastic  flow  with  complete  yielding  may  happen 
for  larger  values  of  a . In  the  case  of  a flat  ring  with  smaller 
values  of  a , it  is  impossible  to  obtain  complete  yielding  in  it 
through  applying  a pressure  on  its  inner  boundary.  The  outer  portion 
of  the  ring  must  remain  strained  elastically  and  a case  of  partial 
plastic  flow  with  thickening  will  occur.  Neither  full  plastic  flow 
for  larger  a nor  partial  plastic  flow  with  thickening  for  smaller  a 
will  be  permitted  under  the  assumption  of  small  strain. 

Some  numerical  results  have  been  worked  out  for  the  2219-T87 
aluminum  plate  with  geometric  ratio  b/a  = 3.  The  material  constants 
[4]  are  n = 9,  v = 0.3,  E = 10.5xl0°psi,  ay  = 5.5xl04psi.  The  effect 
of  p/a  on  the  radial  and  tangential  stress  distributions  are  shown 
in  Figures  1 and  2,  respectively.  The  corresponding  strain 
distributions  for  the  radial,  tangential  and  axial  components  are 
shown  in  Figs.  3,  4 and  5,  respectively.  Finally  it  should  be  noted 
that  the  validity  of  the  above  results  based  on  the  deformation 
theory  have  been  assessed  by  applying  Budiansky's  criterion.  The 
range  of  S and  <p  for  the  above  stresses  and  strains  satisfy  the 
inequality  (24). 
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AN  EFFECTIVE  STIFFNESS  VISCOELASTIC  COMPOSITE  BEAM  THEORY 
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ABSTRACT.  Viscoelasticity  in  the  individual  beam  layers  is 
model ed  according  to  the  standard  linear  model  and  the  Timoshenko  beam 
theory  with  the  resulting  equations  utilized  in  deriving  a micro- 
structure or  effective  stiffness  viscoelastic  laminated  beam  theory. 

A time  harmonic  wave  propagation  along  the  length  coordinate  of  the 
viscoelastic  composite  beam  has  been  utilized  to  illustrate  an 
application  of  the  derived  theory  and  to  point  out  the  influence  of 
the  various  viscoelastic  and  geometric  parameters  involved. 

The  first  task  in  deriving  the  viscoelastic  laminated  beam 
theory  was  to  formulate  energies  for  individual  viscoelastic  layers 
in  terms  of  the  Timoshenko  beam  theory  in  a form  suitable  for 
developing  the  composite  theory.  A goal  of  the  direct  derivation  of 
the  beam  theory,  instead  of  the  intermediate  step  of  developing  a 
viscoelastic  laminated  continuum  theory  whichmust  then  be  reduced  to 
a beam  theory,  was  accomplished  by  the  introduction  of  a gross 
rotation  term  for  the  laminated  beam  into  the  derivation  of  individ- 
ual layer  energy  relations.  The  final  result  was  an  energy  conser- 
vation law  for  the  individual  beam  layers  in  terms  of  kinetic  energy, 
potential  energy,  and  dissipation  energy. 

The  viscoelastic  laminated  beam  is  composed  of  a number  of 
alternating  plane,  parallel  layers  of  two  homogeneous,  isotropic 
viscoelastic  materials  which  are  respectively  termed  the  reinforcing 
layer  and  the  matrix  layer.  To  obtain  the  total  energy  for  the 
viscoelastic  composite  beam,  the  individual  layer  kinetic,  potential, 
and  dissipation  energies  were  sunned  over  the  n layer  pairs  of  which 
the  composite  beam  was  composed.  The  discrete  system  thus  obtained 
was  then  converted  to  a continuous  system  by  means  of  a smoothing 
operation,  that  is  a replacement  of  the  resulting  energy  summations 
by  weighted  integrations  over  beam  thickness.  A reduction  of  one 
variable  from  the  formulation  was  made  possible  through  a continuity 
condition  resulting  from  continuity  of  displacement  across  layer 
interfaces.  The  final  result  of  the  derivational  work  was  a set  of 
three  flexure  equations  of  motion  and  corresponding  boundary 
conditions  for  viscoelastic  laminated  composite  standard  linear  model 
Timoshenko  beams. 
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Time  harmonic  waves  of  the  form 

(w.iM)  = (hW,4',$)e'aye1p^y/c_t^ 

were  passed  through  the  three  equations  of  motion  and  resulted  in  a 
characteristic  equation  in  terms  of  p,  the  circular  frequency;  c, 
the  phase  velocity;  a,  the  attenuation  coefficient;  and  the  numerous 
viscoelastic  and  geometric  parameters  involved. 

1.  INTRODUCTION.  A great  deal  of  work  has  been  accomplished 
in  the  area  of  elastic  laminated  effective  stiffness  or  microstructure 
continuum  theories  and  approximate  plate  and  beam  theories.  By  the 
same  token,  little  has  been  accomplished  with  viscoelastic  counter- 
parts to  these  theories. 

An  elastic  continuum  theory  which  included  effective  stiffness 
for  both  the  reinforcing  and  matrix  layers  of  a laminated  continuum 
was  developed  by  Sun,  Achenbach,  and  Herrmann  [1,  2].  The  continuum 
theory  was  utilized  by  Thomas  [3]  to  study  the  simple  thickness 
modes  for  laminated  media  with  layering  both  parallel  and  perpen- 
dicular to  the  plate  free  surfaces.  Sun  [4]  deduced  a two  dimen- 
sional theory  for  laminated  plates  from  the  three  dimensional 
continuum  theory.  Velocity  correction  coefficients  were  introduced 
into  the  two  dimensional  theory  by  Thomas  [5]  and  flexural  and  exten- 
sional  vibrations  for  plate  strips  and  rectangular  plates  were 
studied  by  Thomas  [6,  7]  according  to  this  theory  and  compared  to 
similar  results  from  effective  modulus  plate  theories.  A micro- 
structure theory  for  an  elastic,  laminated  composite  beam  was  developed 
by  Sun  [8]  and  the  approach  utilized  in  this  paper  will  be  followed 
in  deriving  a viscoelastic,  laminated  composite  beam  theory. 

Thomas  [9]  showed  that  the  flexure  beam  theory  in  reference  [8]  is 
directly  obtainable  through  a simple  reduction  of  the  existing 
flexure  equations  for  composite  plates  [4,  5]. 

A continuum  theory  for  a viscoelastic  laminated  composite  was 
developed  by  Grot  and  Achenbach  [10],  however  the  equations  developed 
were  not  applied  to  any  problems  of  wave  propagation  or  vibration. 

It  is  certainly  theoretically  possible  to  start  with  the  equations 
in  reference  [10],  to  make  appropriate  series  expansions  and  derive  a 
plate  theory,  and  to  then  follow  reference  [9]  to  make  a direct  reduc- 
tion to  a viscoelastic  beam  theory.  However,  for  convenience  and 
simplicity  of  analysis,  the  approach  in  the  current  report  will  be  to 
begin  with  the  viscoelastic  Timoshenko  beam  equations  and  work 
towards  a viscoelastic  laminated  beam  equation  in  the  manner  of 
reference  [8].  With  somewhat  guarded  conclusions.  Stern,  Bedford, 
and  Yew  [11]  have  demonstrated  a definite  need  for  an  effective 
stiffness  type  formulation  for  viscoelastic  laminates. 


The  current  approach  to  obtaining  a viscoelastic  laminated 
beam  theory  will  be  a viscoelastic  development  which  mirrors  the 
elastic  development  given  by  Sun  [8].  Surprisingly,  the  real 
difficulty  is  in  obtaining  the  energies  for  a single  layer  modeled 
as  a viscoelastic  Timoshenko  beam.  The  most  pleasing  and  straight- 
forward development  of  suitable  viscoelastic  Timoshenko  beams  results 
from  a utilization  of  viscoelastic  constitutive  relations  of  the 
differential  form;  it  is  these  equations  which  yield  a viscoelastic 
development  which  closely  mirrors  Sun's  [8]  elastic  derivation. 

2. THE  ENERGY  PRINCIP!  E.  As  Sun  [8]  does  in  the  development 

of  an  elastic  laminated  beam  theory,  the  first  task  in  deriving  a 
viscoelastic  laminated  beam  theory  is  to  formulate  energies  for 
individual  viscoelastic  layers  in  terms  of  the  Timoshenko  [12]  beam 
theory.  In  the  past,  Lee  [13]  developed  viscoelastic  Timoshenko 
beam  equations  for  viscoelastic  extensional  strain  but  the  shear 
strain  was  left  elastic.  Pan  [14]  extended  the  analysis  to  include 
viscoelastic  shear  strains.  The  current  objective  is  to  develop 
the  viscoelastic  Timoshenko  beam  equations  in  a form  more  suitable 
to  the  development  of  a viscoelastic  composite  beam  theory.  A first 
goal  will  be  the  development  of  a single  layer  energy  principle 
suitable  for  a direct  application  in  the  derivation  of  a multilayer 
energy  principle. 

The  development  of  an  approximate  theory  such  as  for  laminated 
elastic  plates  has  originally  been  a two  step  procedure.  In  the 
first  instance,  the  Mindlin  plate  theory  [15]  in  its  first  order 
approximation  was  utilized  to  develop  a continuum  theory  for  laminated 
composites.  Then  to  obtain  a laminated  plate  theory  a first  order 
approximation  is  made  on  those  variables  which  came  from  the  first 
order  part  of  the  Mindlin  theory  as  in  Sun  [4]  and  Thomas  [5]  - this 
explanation  will  become  clear  shortly.  Now  in  developing  an  elastic 
laminated  beam  theory.  Sun  [8]  has  made  both  of  these  approximations 
simultaneously  to  obtain  a flexure  theory  for  laminated  beams. 
Actually,  Thomas  [9]  has  shown  that  the  flexure  beam  theory  is 
directly  obtainable  from  the  existing  flexure  plate  theory. 

The  current  objective  is  to  immediately  derive  a viscoelastic 
laminated  beam  theory  and  to  not  have  to  develop  a viscoelastic  lami- 
nated continuum  theory  first.  In  making  the  various  zero  and  first 
order  expansions  of  displacement,  terms  which  lead  to  an  extension 
theory  are  also  maintained  since  the  second  expansion  of  extensional 
displacements  leads  to  a flexure  term.  The  first  order  displacements 
which  will  result  in  the  Timoshenko  beam  equations  [12]  for  flexure 
as  well  as  an  extensional  equation  for  beams  are 


v(y,z,t)  = v(y,t)  - z<j>(y,t) 
w(y,z,t)  = w(y,t)  - z$>(y,t). 


(1) 


the  zero  order  terms  in  (1)  are  v and  w and  a first  order  expanison 
of  these  two  displacements  results  in  the  expressions 

v(y,t)  = vk(y,t)  - zk  if>a(y,t) 

w(y,t)  = wk(y,t)  - zj  Ya(y,t)  (2) 

where  the  subscript  a = 1,  2 will  later  denote  whether  a stiff  or 
soft  laminated  beam  layer  is  indicated  and  the  superscript  k which 
layer  pair  is  indicated.  While  absolutely  necessary  at  this  point, 
the  notation  in  (2)  jumps  into  the  laminate  notation  while  seeming 
to  be  at  the  single  layer  stage  of  development.  See  Sun,  Achenbach, 
and  Herrmann  [1]  or  Sun  [8]  if  clarification  is  required. 

Combining  equations  (1)  and  (2)  and  extracting  only  those  terms 
which  result  in  flexural  motion  results  in  the  displacement  relations 

v(y,z,t)  = -z^a(y,t)  - z4>(y,t) 

w(y,z,t)  = wk(y,t)  (3) 

where  iK(y,t)  represents  the  gross  rotation  in  the  laminated  beam, 
wk(y,t)  represents  the  transverse  deflection,  and  <+) (y , t ) represents 
the  individual  layer  rotation.  The  various  displacements  and 
rotations  on  the  right  side  of  (2)  represent  the  reduction  from  a 
laminated  continuum  theory  to  a laminated  'beam  theory;  thus,  from 
continuity  of  displacement  and  rotation  at  laminate  interfaces,  it  is 
clear  that  the  notation  may  be  simplified  to  w(y,t)  = wk(y,t)  and 
tKy,t)  = ipa(y,t)  for  a = 1,  2 and  for  all  values  of  k.  "Hence  with 
these  notational  simplifications  in  mind,  the  final  form  of  the  first 
order  flexure  displacement  expansion  is 

v(y,z,t)  = -zk^(y,t)  - z<p (y , t ) 

w(y,z,t)  = w(y,t)  (4) 

where  these  equations  are  valid  only  when  eventually  utilized  in 
developing  a laminated  beam  theory.  Equations  (4)  may  be  reduced  to 
those  for  a homogeneous  or  single  layered  beam  by  setting  ^(y,t)  = 0; 
this  being  done,  equations  (4)  reduce  to  those  given  by  Brunelle  [16] 
for  flexure  of  a beam. 
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The  non-zero  strain-displacement  relations  are 


e = JL  , 3 v 
2 3y  3z 


The  non-zero  stress  equations  of  motion  which  pertain  to  the  problem 
are 


°y,y  + ayz,z  = (6} 

From  the  appendix  and  equations  (A-17)  the  constitutive  equations  for 
a special  case  of  the  standard  linear  model  are 

(1  + C _9|)oyz  = (2kG  + 2k V 3|)  cyz 

^ + C gf^y  = (E  + E ^ £y  ^ 

where  shear  correction  coefficients  k and  k have  now  been  introduced 
in  a manner  similar  to  that  of  Timoshenko  [12]  and  Kindlin  and 
Deresiewicz  [17]. 

The  procedure  involved  in  deriving  the  theory  will  be  to  manipu- 
late the  left  sides  of  equations  (6)  until  they  are  of  the  form  of  the 
left  sides  of  equations  (7).  Thus,  taking  the  first  time  derivatives 
of  (6)  and  multiplying  by  the  viscoelastic  constant  C results  in  the 
equations 

c 4yz,y  ' pC  * 

CVy  tCiyz,z‘pCV'  (8) 

which  when  added  to  their  counterparts  in  equation  (6)  become 

a + C o = pw  + pC  w" 
yz.y  yz,y 

a + C a + a + C a = pv  + pC  v (9) 

y>y  y,y  yz>z  yz>z 

Multiplying  the  first  equation  of  (8)  by  w and  the  second  equation  by 
v,  integrating  over  the  beam  volume  and  time,  and  finally  adding  the 
final  answers  results  in  the  equation 


dA  dy  dt 


/ I*  /* 

JA  Jo  Jo 


r ft 

= / L p 

A u o 


<V,y  * c W4*  (oy.y  * c Vy'4 

+ (a  + C o ) v 

yz,z  yz,z 


v v + C *v  v + w w + C w w) 


dA  dy  dt 


After  several  integrations  by  parts,  equation  (10)  may 
expressed  as 
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it  is  immediately  clear  that 
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dA  dy  dt  = 0 


since  both  beam  surfaces  are  stress  free  and  that 
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'A  Jo 


(ayz  + C 6yz)  w + (oy  + C oy)v 


dA  dt  = 


! )' 

i x, 
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be 
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(12) 
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since  the  boundary  terms  will  be  satisfied  at  the  beam  ends. 
Applying  equations  (5)  and  (7)  to  equation  (11)  and  taking  into 
account  equations  (12)  and  (13)  results  in 


/ I1  /* 

JA  ' o }o 


(2kGcyz  + 2k*G*cyz)  (2eyz) 
+(Eey  + E*cy)ey 


dA  dy  dt 


= / /£  fl  p[(w  + C w )w  + (v  + C 'v  )v]  dA  dy  dt  (14) 
A 


But,  from  the  chain  rule  of  partial  differentiation  it  is  clear  that 


g|[c2]  = ec  + ee  (15) 

or  that 

ee  = \ -A  (e2)  (16) 

2 dt 


Similarly,  the  fact  that  an  indefinite  integral  can  be  defined  as  a 
definite  integral  with  a variable  upper  limit 

/ g ( t ) d t = /tg(t)dt  + const.  (17) 

immediately  results,  after  taking  a time  derivative  of  both  sides,  in 
the  equation 


fl  g(t)dt  = g(t)  (18) 

dt  a 

p 

which  for  g(t)  = e results  in  the  relationship 

e2  = ^ f*(t)2d t (19) 

dt  o 

A direct  application  of  relations  (16)  and  (19)  to  equation  (14)  with 
an  introduction  of  equations  (4)  and  (5)  results  in  the  equation 
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after  an  integration  over  the  beam  area  where 


dy  dt 


+ t £ <L 

o o 2 dt 


A = bd 


r - M 
“ 12 


dy  dt  = 0 

(20) 

(21) 


with  b being  the  beam  width  and  d being  the  beam  thickness. 

Following  Anderson  [18],  a conservation  law  is  sought  in  the 
exi stance  of  a quantity  H such  that 


such  that  obviously 


where 


H = constant, 


™ = o 
dt 


H = T + U + V 


(22) 

(23) 

(24) 


with  the  quantities  T,  U,  and  V being  called  the  kinetic  energy,  the 
potential  energy,  and  the  dissipation  energy.  From  a comparison  of 
equations  (20),  (23),  and  (24)  it  is  clear  that  the  various  energies 
may  be  defined  as 

I « fZ  /J  T*  dy  dt 
o u 
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U = /£  Z1  U*  dy  dt 
o o 

V = /*-  /t  v*  dy  dt 
o o 

and  from  equation  (20)  it  is  clear  that  the  energies  are 

T*  = | [Aw2  + A(z£)2^2  + 42] 


★ 
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1 AkG(|^  -4>)2  + 

2 v3y 


Aiyzk\2/3^w 
2 a ay' 


+ IL(M)2 

2 >' 


(25) 
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V = 


AkV(i  4)2  + E*I(|i)2 


n 


AE*(z£)2(iji)2  -pAC  w: 


- pAC  (zJ)V-pIC  <J> 


dx 


(26) 


3.  THE  LAMINATED  BEAM  THEORY.  The  laminated  beam.  Figure  1,  is 
composed  of  a number  of  alternating  plane,  parallel  layers  of  two 
homogeneous,  isotropic  viscoelastic  materials  which  are  respectively 
termed  the  reinforcing  layer  and  the  matrix  layer.  The  reinforcing 
layer  is  the  s t if fer  of  the  two  layer  combination  and  is  indicated 
by  the  subscript  "1"  while  the  softer  matrix  layer  is  indicated  by  the 
subscript  "2".  The  elastic  constants,  the  viscoelastic  constants,  the 
layer  density,  and  the  thijkne^s  for  the  reinforcing  and  mjtrij  layers 
respectively  are  E, , , G] , C-j , p-. , d-j , and  E2,  G2,  E2,  G2,  C2» 

P2>  d2. 

The  basic  variables  involved  are  w,  the  transverse  deflection; 
ip,  the  gross  rotation  of  the  stiff  layer;  and  <t>~,  the  rotation  of  the 
soft  layer.  The  midplane  positions  for  the  kth  pair  of  neighboring 
reinforcing  matrix  layers  are  y*  and  y£  respectively  as  indicated  in 
Figure  1,  with  the  layer  midplanes  taken  perpendicular  to  the  z-axis. 
The  width  of  the  beam  is  b and  the  total  or  gross  thickness  is  h. 


From  equation  (26),  the  kinetic,  potential,  and  dissipative 
energies  in  the  individual  layers  are 
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C - ^CAaw2  * A^)2*2  ♦ I^2] 


>k  - 1 


-6  )‘ 


a 1 a a a'jy  ~V 

* ^(zj)2(|^)2  + EfiM^>2 

AakX<§  -V)2  * C'a<^)2 
C ’ lo  * -*AA.  “2 
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where  a = 1,  2 respectively  gives  the  reinforcing  and  matrix  layer 
energies. 

Now,  the  three  energies  are  summed  over  the  n layer  pairs  to 
determine  the  total  energies  for  the  composite  beam 

t*  - 1"  (r;k  * T:k) 

k=l  c 

* ^“n  / *k  *kx 

u = i (u/  + u;k) 


v*  - T <v;k  ♦ v*k)  . 


It  is  now  convenient  to  convert  the  discrete  system  (28)  to  a contin- 
uous system  by  utilization  of  a smoothing  operation,  that  is  to 
replace  the  sunmations  in  (28)  by  weighted  integrations  over  the 
thickness  variable  z. 


The  result  of  the  smoothing  operation  is  the  energies 


t* 

i « 
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h/2 

- / 
-h/2 
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(T*  + T*)  dz 

<V¥ 

h/2 

■J, 
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(U*  + U*)  dz 
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* T 

-h/2 

1 

(V*  + V*)  dz 
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(29) 

k 

Z2‘ 

(30) 

where  after  smoothing 


Carrying  out  the  integrations  in  (29)  in  terms  of  (27)  and 
taking  into  account  (30)  results  in  the  energies 
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At  this  point,  continuity  of  displacement  at  the  interface  of 
the  kth  pair  of  layers  must  be  considered.  Applying  equation  (4)  to 
a multilayer  beam  results  in  the  equation 


va(y,z,t)  = -zk  (y,t)  -z<f>a  (y,t) 


and  with  the  aid  of  Figure  2 it  is  clear  that 


V1  = -V  + "pi 


-v  - t*2 


at  the  interface  between  layers  1 and  2.  It  is  also  clear  from 
Figure  2 that 

z2  = Z1  ‘ 2^dl+d2^ 

and  that  equations  (33)  describe  the  same  interface  such  that 

v1  = v2  . (> 

From  equations  (35)  applied  to  equations  (33)  it  is  clear  that  the 
continuity  condition  is 


where 


= n<J>j  + ( 1 -rj)d>2 

dl  II  \ d2 

n ° Tvv  ’ ('"n)  " (v5?  ' 


(38) 


Following  Sun  [8],  the  variable  <t>2  is  eliminated  such  that 

_ 

*2  ' O^nT 

where  for  convenience  the  notation  <t>  = has  been  introduced. 

Expression  (38)  is  directly  substituted  into  equations  (31)  and 
the  dimensionless  variable 

e - h/(d1  + d2)  (39) 

is  introduced  to  yield  the  energy  expressions 


where 
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Now,  all  the  squares  of  the  various  sums  in  equations  (40)  are 
expanded  out  to  yield  the  final  forms  of  the  energy  expressions  as 

T = ^a4w^  + 2^a9^  + ^a13^  ^a10^ 

U*  = 2^al  (|y)2  -5*3^  + ^a12^  '^a2^y 
+ -?a8#  + ^a5(||-)2  + ^a^^)2 

^(fy)2  -2Cb3^fy  + ?b12^2  _25b2^|y  + ^2 

-2Cbg#  + Cb5(|^)2  + ?bn(f^)2  -2?b70^y 


v*-/* 

o 


dt  (42) 


J_-^b4w2  -?bgij)2  -Cb-j 3<p2  + 2£b10# 


where  the  constants  a.,-  are 


a-|  = A -j k i G i A2k2G2 


a2  = A2^2^2^^  _T1^ 

a3  = A^G^  - A^Gg/O-n) 


a4  = pi^l  + p2^2 


¥2 


a5  = 7^1  + (1_ti)E2]  + (t^F 


ag  = A2k2G2/(l-n)2  = a2/(l-n) 


. - [\_ r T 

7 " (1-nj7  2J2 
n 

an  = 


8 = (l-n)2  A2k2G2  = na6 
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a9 = Ttnpi  * (,-r,)p2]  + 77 


all  = El!l  + ( t -n) 2 E2!2 

2 

3-io  = A-i  k-.  G.  + n AokoG« 

12  ill  (i-n)?  222 

2 

a13  = pl!l  + (l-n)?  P2X2 


which  corresponds  to  the  elastic  constants  given  by  Sun  [8]  for 
elastic  laminated  beams  and  where  the  constants  b-j  are 

* * * * 
b-|  = A-|  k-|  G-|  + A2k2G2 

b2  = A2k2G2/(l-n) 

b3  = AlklGl  " A2k2G2 

b«  = Pi  Ai  C-j  + P2^2E2 

H * 

IK  * * E2I2 

b5  = ^E1  + (1‘r')E2]  + 

b6  = A2k2G2/  (1  -n)  = ^2/  ( 1 — ti  ) 

b7 = ^7 

n * * 

b8  " “ 72  A2k2G2  " nb6 
(l-n) 


bg  = pCnp-j Ci  + ( 1 -n) P2C2]  + - g P2I2C2 


>10 


(1-n) 


1 p2r2c2 


b11  " Ml  + 


( 1 -n) 


★ 

E„I. 


^2*2 


b12  = A1k]G1  + ~L- 2 A2k2G2 

(1-n) 


★ ★ 


51 3 = Pi1!1'! 


I,C,  + 


(1-n) 


2 p2!2C2 


(44) 


which  corresponds  to  the  viscoelastic  contribution  of  the  current 
analysis  of  viscoelastic  laminated  beams.  It  should  be  noted  that 
the  author  [19]  has  evaluated  viscoelastic  shear  correction  constants 
in  another  gaper  and  based  on  this  evaluation  it  is  clear  that  k-j  = 

^2  = kl  = ^2  = 77  '^2* 

Now,  from  equations  (22-25)  in  conjunction  with  equations  (42) 
it  is  easy  to  form  energy  principle  (23),  that  is  dH/dt=0,  which  upon 
various  integrations  by  parts  and  a gathering  of  common  factors  of 
w,ij>,  and  5>  results  in  equation  (23)  becoming 

2 2 * 

, 3 w + + ..  94l  + a !'i  k 3 w . u 3^|. 

- al^T  * a2*  + “3S,  * a4”  - *>1^7  + b2# 


at  ■ iy; 


♦ iy* 


* i1'  A 

o o 


Ltb3|+ V“ 


- a 


3w  a2^  32a 

23y  • a57T  + V + a7rf  ' V + V 

dy  dy 

h 3w  . 9^  + 

- b0..#  - be  o 


- a. 


10 

L+  V ■ bio^ 


32j> 


23y  “5^2  T b6^  + b7t2  " V 


9y 


9y 


3w  . d2\b  , a2* 

3¥  + %7  ‘ V - aio*  " ana7 


+ a12*  + a13#  ' b3ty  + b77T  ' V " b10^ 


3w 


32i|j 


2* 

' blV^2  + b12*  + b13* 


dtdy 


dtdy 


dtdy 
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,t  . 
+ / Iw 
0 


+ I1™ 

0 


-14 


aia7  ' V ' V + biay  ' b2*  " b3* 


dt 


Jo 


M 

a53y  " a73y  T u53y 


M . 3&  _ b M 

3v  + b53V  b7gy 


dt 


a « * a,,!*  - b,  |*U  b.#l 
73y  11 3y  7 3y  1 1 3y 


dt  = 0.  (45) 


The  viscoelastic  equations  of  motion  and  boundary  conditions  for 
laminated  beams  are  now  obtained  by  applying  the  first  lemma  of  the 
calculus  of  variations  to  equation  (45).  Thus,  the  three  equations 
of  motion  are 


A 

V 

3w 


3^w  34  34  . 3^w  . 34  . 9*  - . ... 

a1^2  - >2^  * a357  + b1^2  - b25j  - b33y  ■ a4w  * b4w 

‘23y  + a50  - a6*  • a7^f  + V + b2^  + ^ ' b6* 


324> 

- — 2 + b84>  = ag'P  ~ a10^  + b9^  “ bi 

3y 

3w  324  . , . 3^4  . . k 3w  w 324  . . 

a3W  - a7^2  + V + all3y2  ' V*  * b3*  ' b7^  + ha* 
2- 

+ bn^  " bl2*  = " am^  + aiV»  ' bio*  + bi.^ 


9y 


10 


(46) 


and  the  corresponding  boundary  conditions  are 


9w  3w  * * 

ali7  - a2*  - a3»  + bls7'  b2*  ■ b3*  ' °> 


or 


w = 0 on  y = 0,£ 


(47-a) 
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or 


asly  - a75y  * b5|  - b75f  * 


or 


ij>  = 0 


on 


y = 0,£ 


a Ml  . a M.  + b M - b,,^  = 0, 

7ay  l l ay  7dy  llay 


<p  = 0 


on 


o ,£ 


(47-b) 


( 47-c ) 


4.  WAVE  PROPAGATION.  Following  Sun  [8],  but  with  a visco- 
elastic counterpart,  a ume  flexural  wave  propagation  in  the  y- 
direction  of  the  form 


a)  = hWe-0^  eiP(y/c"t) 
* = vj/g^y  eip(y/c-t) 

4>  = <te<iy  eiPty/c-t) 


(48) 


, \ 

[ > ; 

r* 


Where  ais  the  attenuation  coefficient,  p is  the  circular  frequency, 
and  c is  the  phase  velocity.  It  is  also  convenient  at  this  time  to 
introduce  some  additional  relationships  as 

p = 2m 
c 

to  = — 

X 


X = 


2ttc 

P 


CT 


2n 

T = 

P 

K = 21 

X 

B = Xa  (49) 

Where  u is  the  frequency,  X is  the  wave  length,  x is  the  period,  K 
is  the  wave  number  and  P is  the  attenuation  constant. 


Now,  equations  (48)  are  passed  through  differential  equations 
(46)  to  obtain  the  characteristic  equations  for  wave  propagation. 
At  the  same  time,  the  following  dimensionless  parametric,  elastic, 
and  viscoelastic  dimensionless  variables  are  introduced 


l ■ * 


. 

The  final  form  of  the  characteristic  equation  for  viscoelastic  wave 
propagation  is 


(R,,  + i 

lu) 

(R-|2  + i ^12^ 

(r13  + 1 

!1 3) 

(^12  + ^ 

!1 2) 

(R^2  + i ^22^ 

(R23  + 1 

T23^ 

= 0 

(R13  + i 

T1 3^ 

(R23  + 1 !23^ 

(R33  + 1 

!33^ 

(51) 

i'V 

i; 

r» 


Where 


! ■ 
l 

k 


I ■ 

I , 

I . 


} 


R11 

= -an 

B2 

+ b„  6 - 

c V2 
C11V 

+ dn 

R12 

= r21 

= - 

b12  B + d 

12 

R1 3 

= R31 

= - 

b-|3  g + d 

h 3 

R22 

= a22 

32 

- b22  B + 

C22V  ' 

' d22 

R23 

= R32 

= - 

2 

a23  B + 

b23  B ‘ 

■ c23V 

R33 

= a33 

B2 

- b33  B + 

c33v2  ‘ 

' d33 

*11 

= A11 

32 

+ Bn  3 + 

cu\l2  ■ 

■ D11 

!12  = !21  * b12  B + D1 2 


!1 3 = *31  " B13  6 + D 


13 


*22  A22  B B22  B " C22V  + °22 


23  " *32  ~ A23  B + B23  8 + C23V  ' °23 


33 


A33  ^ ‘ B33  B " c33y2  + °33 


(52) 


The  constants  introduced  into  equation  (52)  are  defined  as 

ex  i/  y a k 

. - 11  + JL2. 

an  =z“ 


x2 


4tt  a2k*Y2 


X2 
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'11 


11 


4tt2  8ai 

= ~3T~  + 

A 

2 

4tt  k^y 


- 2 
4tt  ctg 

2 

4tt  ctgkg 


>12 


'12 


= a2k2 
(l-n)A 

O I * * 

tTTOt^  ^ 

(l-n)X 

alklY  _ n«2k2 
A (l-n)A 


b13  = ~ 


* * 


13 


22 


27TalklYl  2un  ,*  * 

^ 


>22 


'22 


22 


A 

. Ebn6l 
4iTe. 

D I 

€ A2 


eh(l-n)6? 

+ ~ -Z2--  + 
5 ^ 


e262 


(l-n)2X2 


4TT£b(l-n)62  ^c262 


d2 


(l-n)2!2 

2 2 2 
4ir  e^riQ  4tt  ( 1 -rj ) 4tt  Eg 

+ (l-n)2X2 


5 X2 

2 

4ir  £.  n6, 

D I 

Tx2- 


c X2 

4tt2£.  (1  -n)6. 


?X2 


4Tf2e  6 a k 
2 2 + 22 

(l-n)2^  (T^r 


23 


ne262 

(l-n)2X2 


B = J2 
12  2, 


D 1 2 = 2 Trb  *j  2 


r - 1 3 

B'3  17 


°13  ' 27Tb13 


B22  = 47T8  22 

87r3ebn6c'T  8Ti3eb(1-n)c'2  8^C2C2 

CX2  + Tx2  + (l-n)2x* 

_ 4TT2Eb(l-n)6* 

Uoo  - n + « 


. ? * . * * 
a2*2Y2 

+ O-n)2!2  + O-n)2 


B23  " 4tt323 
c23  = 2itC2c23 
2 . 

4tt  ne262  m^k^y* 
(l-n)*?  + (1-n)2 


6?en 


n2<SiEr 


23 


’33 


c33  = 


33 


+ 

x2 

(l-n)¥ 

47t6i  ei 

47rn262e? 

X2 

O-n)2!* 

3 

8tt  Be^c 

1 87T  n ££^2 

X2 

+ (l-n)2x2 

m 2 

4ir 

* 

47r^n262£2 

* * n a k*Y* 

+ a]k1y1  + =-ir-=- 


(l-n)‘ 


(53) 


A numerical  solution  to  characteristic  equation  (51)  is  possible 
if  it  is  recast  as  the  following  function 


f ( B » V ) = ABS  ( DET | R-j j + ili.il) 


(54) 


Using  a numerical  technique  such  as  the  Rosenbrock  [21]  optimization 
procedure  a solution  is  obtained  when 


f(M)  = 0 . 


(55) 


5. SUMMARY.  An  energy  principle  has  been  formulated  for 

viscoelastic  Timoshenko  beams  according  to  the  standard  linear 
model  with  the  stipulation,  and  hence  additional  terms,  that  the 
energy  principle  be  utilized  in  building  a viscoelastic  laminated 
beam  theory.  The  Timoshenko  model  considered  has  accounted  for 
both  viscoelastic  extensional  and  viscoelastic  shear  strains.  To 
later  incorporate  the  single  layer  energy  principle  into  the  develop- 
ment of  a laminated  beam  theory,  a term  which  accounts  for  the  beam's 
gross  rotation  was  included  in  the  single  layer  development. 

Using  the  single  layer  energies  developed,  a viscoelastic  lam- 
inated beam  theory  composed  of  a number  of  alternating,  plane, 
parallel  layers  of  two  homogeneous,  isotropic  viscoelastic  materials, 
termed  the  reinforcing  layer  and  the  matrix  layer,  was  derived.  In 
deriving  the  theory,  the  individual  layer  kinetic,  potential,  and 
dissipative  energies  were  summed  over  n layer  pairs  to  obtain  the 
total  energy  of  the  composite  beam;  these  results  are  converted  to  a 
continuous  system  by  utilization  of  a smoothing  operation  or  weighted 
integration.  The  number  of  independent  variables  in  the  total  composite 
beam  energies  is  reduced  from  four  to  three  thru  the  introduction  of 
a condition  for  continuity  at  layer  interfaces.  A direct  application 
of  the  energy  principle  developed  to  the  composite  beam  energies 
results  in  a set  of  three  equations  of  motion  and  their  corresponding 
boundary  conditions  for  viscoelastic,  laminated  composite  beams. 

Flexural  wave  propagation  has  been  considered  by  passing 
viscoelastic  harmonic  waves  through  the  derived  equations  of  motion. 
Numerical  solutions  are  possible  by  applying  the  Rosenbrock 
optimization  procedure  to  the  resulting  characteristic  equation.  A 
lack  of  computation  funds  precludes  the  presentation  of  numerical 
results  at  the  present  writing. 
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APPENDIX^  The  present  objective  is  to  derive  a set  of  consti- 
tutive relations  which  can  be  utilized  in  conjunction  with  the  basic 
equations  for  a Timoshenko  beam.  While  constitutive  equations  may 
be  formulated  in  either  integral  or  differential  form,  preliminary 
work  in  the  direction  of  formulation  of  a viscoelastic  beam  theory 
for  laminated  composite  materials  indicates  that  the  differential 
form  of  constitutive  relations  will  be  most  useful.  The  differential 
constitutive  relations  will  be  utilized  in  the  present  development. 


The  general  form  of  the  differential  constitutive  equations  is 
adapted  from  Fung  [20]  where  the  stress-strain  relations  are  of  the 
form 


where 


P,(D)o!j  - Q,(D)eij 

P2(D)okk  » Q2(D)ekk  <A-1> 

P_.(D)  and  Q ^ ( D ) are  given  by 

k=nl  k 
P (D)  = l a Dk 
1 k=o  K 

k-n«  . 

Po(D)  = l \Dk 

L k=o  k 
k=m1  v 

Qi(D)  = l ]bkDk 

1 k=o  k 
k=ml  v 

QJD)  = l d.  Dk  (A-2) 

£ k-o  K 


with  D being  the  time-derivative  operator  of  the  form 

D7f  = (A  - 3 ) 

at1 

and  where  o.^  and  eji  are  the  components  of  the  stress  and  strain 
deviators 


in  which  o. ■ and 

* J 

Now,  assume 
1 inear  model 


o'  . 
U 


e'.  . 
ij 


'1J 


3 6ij  °kk 

' 3 6i j ekk 


(A-4) 


e^j  are  the  components  of  stress  and  strain, 
equations  (A-l)  to  have  the  form  of  the  standard 


(1  ♦ A > ■ (E  ♦ C 


(A-5) 


where  o is  stress  and  e is  strain.  Comparing  the  form  of  (A-5)  with 
equations  (A-l)  it  is  clear  that  to  have  the  form  of  the  standard 
linear  model  it  must  be  true  that 

nl  = mi  = n2  = m2  = 1 (A-6) 


and  operators  (A-2)  in  light  of  (A-6)  reduce  to 

P-j  (D)  = aQ  + a.jD 

Q-j  (D)  = bQ  + b-jD 

P2(0)  = CQ  + 


Q2(D)  = dQ  + 


(A-7) 


As  will  be  subsequently  seen,  the  only  non-zero  stresses  and 
strains  for  a Timoshenko  beam  with  its  y-axis  along  the  length  and  its 
z-axis  through  the  thickness  are  ay  and  a & cy  and  e . Thus,  from 
equation  (A-4)  the  non-zero  stressyand  strain  deviators  are 


2 

3 °y 


yz 


yz 


3 y 


e = c 
yz  yz 


(A-8) 
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Now,  a direct  substitute  of  equations  (A-2),  (A- 7 ) , and  (A-8)  into 
equation  (A-l ) results  in 


[1  + (a1/ao)0]ay2  = [(bQ/a0)  + (bl/ao)D]Gyz 

[1  + (a1/a0)D]ay  = [(b0/a0)  + (b-,/a0)D]ey 

[1  + (C1/C0)D]ay  = [(d0/C0)  + (d1/C0)D]cy  . (A-9) 

There  are  thus  two  equations  for  stress-strain  in  the  y-  coordinate 

Dl°y  = °2ey 

D3°y  = D4£y  * (A-l 0) 

where 

D1  = l-(a1/a0)D 

D2  = (b0/a0)  + (b-]/a0)D 

d3  = 1 + (C1/C0)D 

°4  = (d0/Co)  + (dl/C0)D  * (A-]1) 

and  they  must  be  combined  to  form  a single  constitutive  equation 

2D1 D3o  = (D2D3  + D1D4)ey  . (A-l 2) 

Now,  from  both  the  right  and  left  sides  of  equation  (A-l 2 ) it  is 
clear  that  the  constitutive  equation  is  of  the  form 

(1  + aD  + bD2)oy  = (1  + C D + dD2)ey  , (A-l 3) 

but  it  would  now  be  desireable  to  have  the  form  of  the  standard  linear 
model  as  in  equation  (A-5),  if  possible.  This  can  be  achieved  if  the 
restriction  is  now  made  that 

0,  = D3  = 1 + (a1/ao)D  (A-l 4) 

such  that  equation  (A-l 2)  now  becomes 

[1  + (a1/aQ)D]ay  = 2t(bo/ao  + do/Co'  + (bl/ao  + dl/Co^D^ey  * (A'15^ 
As  a final  step,  define  the  constants 


(A-l 6) 


r 


C . a]/ao 

E * 2<bo/ao  * W 
E*  ‘ >,/%  * dl/Co> 


2G  ' Vao 

2G*  = b1/aQ 

with  the  final  form  of  the  constitutive  equation  thus  being 

('  * C Pf!°yz  ■ <2G  * 2G*Jf>£yz 

<'  * c £>“y  * <E  + E*^y  ■ 


(A-l 7) 
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ABSTRACT.  N.  J.  Patterson,  in  an  unpublished  note,  observed 
that  the  weight  distribution  of  a linear  code  could  be  computed 
using  a Fast  Hadamard  Transform.  In  this  paper,  we  expand  on 
Patterson's  rather  brief  exposition,  providing  a proof  that  the 
method  actually  produces  the  weight  distribution  and  making  a 
comparison  of  the  storage  and  time  involved  using  Patterson's 
method  and  the  "brute  force"  approach. 


The  weight  distribution  of  a linear  code  contains  a lot  of 
information  about  the  code,  including  its  minimum  distance  and  the 
probabilities  of  decoding  error  and  failure  if  the  decoding  algorithm 
decodes  all  patterns  of  <t  errors  and  nothing  else  (cf.  [3]).  It 
is  not  surprising,  therefore,  that  there  has  been  much  effort  expended 
in  investigation  of  weight  enumeration  of  linear  codes.  In  the  case 
of  linear  binary  codes,  a method  for  computing  weight  distributions 
involving  Fast  Hadamard  transforms  [1]  in  an  unpublished  note  by 
N.  J.  Patterson  has  certain  computational  advantages  over  the  "brute 
force"  technique  of  weight  enumeration  (in  which  a basis  for  the 
code  is  chosen  and  every  possible  linear  combination  of  the  basis 
codewords  is  taken  in  an  unimaginative  way,  with  the  weight  of  each 
codeword  recorded  as  the  codeword  is  derived) . In  this  paper  we 
expand  on  Patterson's  rather  brief  discussion,  providing  a proof 
that  the  method  actually  computes  the  weight  distribution  of  a linear 
binary  code  and  making  a comparison  of  this  technique  with  the  brute 
force  approach. 


Let  A be  a (n,k)  linear  code  over  GF(q),  with  "weight  enumerator 
polynomial" 


n 

W (x,y)  = l A.  xn  1 y1 2  , 
i=0 


where  A-^  is  the  number  of  codewords  veA  with  weight  w(v)=i.  Let  A^~ 
denote  the  dual  of  A.  MacWilliams'  Identity  states  that 

(1)  | Ax t W (x,v)  = W ,(x+(q-l)y,x-y) . 

A Ax 

If  A is  a (n,k)  linear  binary  code  then  (1)  becomes 

(2)  2n  Sj  (x,y)  = W . (x+y  ,x-y)  . 

A A1 


For  convenience,  we  will  assume  that  A is  binary.  The  method  is 
quite  general,  though,  and  has  obvious  extensions  to  cases  when 

q>2- 

Assume  that  k > or  that  A has  rate  k/n  > |.  If  k/n  <_  -, 

the  following  procedure  should  be  modified  by  interchanging  A and  A'1- 
ar.d  replacing  k by  n-k.  Let  H denote  an  (n-k)xn  parity  check  matrix 
for  A,  say 


L n~k_1J 

where  the  rows  V£,  0<ij<n-k-l,  are  vectors  in  GF(2)n  which  constitute 
a basis  for  A-K  Write 


f t t t 

^V°  "•  Vk-l 


Un-1  J 


where  u = (u  , u.,,  ...,  u.  ) is  the  binary  (n-k)-tuple 

l 10  il  i,n-k-l  n k 

comprising  the  i-th  row  of  H11.  Suppose  0<s£2n  -1,  say 

n-k-1 

s = [ s 2J.  Let 

j=°  3 n~^-1 

n-1  n-1  J SjUij 

bs  = l (-DS'Ui  = l (-1)J=° 

S i=0  i-o 

Notice  that  if  we  define  f:  V = GF(2)~  ■>  C = complex  numbers  by 


f(v)  = 


1 v=u.  for  some  i,  0<i<-n-l; 


0 otherwise. 


r»  V * S 

then  b_  = ) f (v)  (-1)  . Therefore,  b is  an  n-dimensional 

S vtV  s 

Hadamard  transform  [1]  of  f.  Now,  the  vector 


which  may  be  termed  the  "s-th  codeword"  in  A-1-.  Clearly,  as  s runs 
through  all  the  integers  from  0 to  2n~^-l,  V(s)  runs  through  all 
the  codewords  of  A-1-. 


We  have  just  shown  that  s*uj[  is  the  i-th  coordinate  of  V(s), 
and  thus  bg  = the  //  of  O-coordinates  in  V(s)  minus  the  # of  1-coordinates 

= n - 2w(V  (s)  ) . 


Hence  w(V(s))  = (n-bg)/2.  Thus,  we  can  compute  the  weight  distribution 
of  A*1  (and  subsequently,  using  MacWilliams'  Identity,  of  A)  via  the 
Fast  Algorithm: 


Step  0. 


Step  1. 
Step  2. 
Step  3. 


Select  a basis  {v0,  vlf  ....  vn_^_1 } for  A . Let  B^ 
denote  the  coefficients  of  W . , initialized  to  0, 

0<i<n.  A 

n-k 

Compute  the  "bulges"  b , 0<s<2  -1,  using  a Fast  Hadamard 

Transform. 


,n-k 


For  each  s,  0_^sf2  -1,  let  i = (n-b  )/2  and  replace  B. 

. i . t.  s i 


by  1+Bi. 

Use  the  equation  (MacWilliams  1963) 

.n-i 


n-r  . , r 

= 2k~r 


l C 


i=0 


l ( a)B.,  0<r<n  , 

i=0  n"r  1 


to  compute  the  coefficients  A^,  0<i£n. 

ri“k 

A glaring  disadvantage  of  this  method  is  that  all  of  the  2 
bulges  bs  must  be  saved.  If  not  enough  storage  is  available,  then 
the  algorithm  must  be  modified.  The  advantage  is  that  the  work  factor 
of  the  method  is  (n-k)2n“k.  By  contrast,  the  brute  force  method 
requires  the  computation  of  (s0,  s^,  ...,  s , )H  for  each  of  the 

2n  c vectors  s = (sq,  sj,  ...,  sn  ^ ^ eGF(2)n~*c.  This  could  be 
accomplished  by  the  following: 

Brute  Force  Algorithm: 

Step  0.  Select  a basis  {v0,  vj,  ...,  } for  A.1- . Let 

denote  the  coefficients  of  W^x , initialized  to  0, 

0<i<n-l,  and  let  s = 0 = (0,  0,  . ..,  0). 
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'"‘•ep  1.  Let  i = 0,  v = (0,  0,  0) 

Step  1.1.  If  Sj=l,  replace  v by  v+v^.  When  q>2,  this  requires 

n additions.  When  q=2  these  n additions  can  be  accom- 
plished by  several  mod  2 additions,  the  exact  number 
depending  on  word  size  of  the  machine  used 
to  implement  the  algorithm. 

Step  1.2.  Replace  i by  1+i. 

Step  1.3.  If  i<n-l,  go  to  step  1.1.  Otherwise,  go 
to  step  2. 

Step  2.  Compute  a = weight  of  v and  replace  B by  1+B  . 

n— "k. 

otep  3.  Replace  s by  1+s.  If  sf_2  -1,  go  to  step  1.  Otherwise 

stop . 

On  the  average,  the  vectors  s in  the  Brute  Force  Algorithm 
will  have  density  (n-k)/2.  Thus,  the  work  factor  for  this  algorithm 
is  n(n-k)2n-k-1 . That  is,  the  extra  cost  in  time  is  proportional  to 
n.  The  advantage  of  this  method  is,  of  course,  that  the  only  storage 
required  is  for  the  arrays  {A.},  {B.}  and  H.  If  A is  cyclic,  with 
parity  check  polynomial  h(x)  1(of  degree  k) , then  (regarding  a vector 
in  GF(2)n  as  a polynomial  of  degree  £n-l) , we  may  take  Vg=h(x), 
v^xhCx),  ...,  Vn-k-j  = xn~k-1h(x),  So  that  onl^  yo  nee<^  t>e  saved. 

(In  this  case,  (sQ,  Sj , ...,  sn_k_x)H  = h(x)  £ SjX1.) 


In  conclusion,  using  a Hadamard  transform  to  compute  the  weight 
distribution  of  a (n,k)  linear  code  results  in  a time  saving 
proportional  to  n at  a cost  in  storage  of  approximately  2n-k  words. 

The  technique  is  particularly  advantageous  for  high  rate  codes. 
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ABSTRACT . Bessel  functions  of  orders  zero  and  one  for  moderate  and 
large  positive  arguments  have  been  programmed  in  FORTRAN  using  factorial 
series  for  J (x) , Y (x)  and  K (x)  and  Hadamard  series  for  1 (x) . A 
subroutine  to  calculate  Stirling  numbers  of  the  first  kind  was  developed 
for  use  in  the  factorial  series.  The  recurrence  relation  was  modified 
and  the  resulting  Stirling  numbers  scaled  so  that  the  entire  range  of 
the  computer  was  utilized;  e.g.,  10  < S < lO1^  instead  of  10f*  < 

S < 10150.  In  this  way,  more  terms  of  the  series  can  be  calculated  and 
higher  accuracy  obtained.  For  use  in  the  Hadamard  series,  a sub- 
routine to  calculate  incomplete  gamma  functions  was  developed.  Various 
algorithms  were  necessary  to  encompass  the  required  range  of  arguments. 

These  programs  were  devised  to  verify  the  accuracy  (for  moderate 
and  large  arguments)  of  our  previously  developed  Bessel  function  sub- 
routine. These  programs  replace  the  asymptotic  series  with  convergent 
series,  which,  of  course,  is  desirable.  Extension  of  the  program  to 
complex  arguments  is  now  in  progress. 


1.  INTRODUCTION.  Factorial  series  derived  from  the  Laplace 
integral  converge  rapidly  for  large  values  of  the  argument,  and,  thus, 
are  preferable  to  the  corresponding  asymptotic  series.  However,  the 
traditional  algorithm  leads  to  very  large  numbers  and  must  be  modified 
if  it  is  to  be  useful  for  numerical  work.  One  procedure  for  scaling  the 
large  Stirling  numbers  which  occur  in  the  analysis  is  derived  below. 

Factorial  series  based  on  a Laplace  integral  evaluated  between 
finite  limits  will  generally  diverge,  so  that  an  alternate  procedure 
is  required.  One  method,  due  to  Hadamard,  is  to  expand  the  Laplace 
integral  in  a series  of  incomplete  gamma  functions.  The  resulting  series 
converge  rapidly  for  large  values  of  the  argument.  In  practice,  expan- 
sions in  terms  of  the  Kummer  function  are  more  convenient  for  computation. 
These  functions  are  closely  related  to  the  incomplete  gamma  function. 
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Computer  programs  based  on  these  algorithms  will  be  used  to  check 
the  accuracy  of  the  BRL  subroutines  for  Bessel  functions  of  complex 
argument  and  integral  order.  This  is  necessary  as  tables  are  not  avail- 
able for  a sufficient  range  of  order  and  argument  to  make  a detailed  check 
by  comparison. 

2.  FACTORIAL  SERIES.  The  factorial  series  are  used  to  calculate 
Kn(x),  Jn(x),  Yn(x). 

Kn(x)  can  be  expressed  in  terms  of  the  Whittaker  function  as1 


‘,w  *(h)  1/2  "o,„<2*>- 


where  the  asymptotic  expansion  for  the  Whittaker  function  is2 


W (2x)  = e' 
o,nv 


j 1 + £ [n-(-l/2)2]  [n^-  (-3/2) 2]  . 
( m=l  m! (2x)m 


[n2-  (1/2  - m)2] 


This  asymptotic  expansion  was  derived  from  a Laplace  integral  evaluated 
between  zero  and  infinity  and  involves  only  negative  integral  powers  of 
the  argument. 

For  n = 0, 


/ \l/2  l , 2 .2. .2  . 2 . _2 . _2 

„ , _h  \ -x  ) , 1 ,13  135 

o ’ 'W  ' I ' 1!(8x)  21 (8x)2'  3!(8x)3 


1/2  k A. 

e'x  E 

I j=0  X-1 


For  n = 1, 


„ /it  \1/2_-x  j , 1 ' 3 l2' 3'5  , 1 2 ‘ 32 ' 5 ' 7 

M \2x  ) e j 1 + 1 * ( 8x1  ~ 2 + 3 

1 \ZX/  / 1 ' 2!(8xr  3!  (8x)  * 


' \l/2  k B. 

h)  o-x  E -j 

/ j=o  xJ 


A computer  tabulation  of  the  first  fifty  of  these  coefficients  is 
shown  in  Table  I. 


1 Handbook  of  Mathematical  Functions , NBS55,  U.S.  Government  Printing 
Office , 1964 , p.  377. 

2 

Modem  Analysis,  E.  J.  Whittaker  and  G.  N.  Watson,  Umve. 1 tty  Press, 
Cambridge , England,  1927,  p.  343. 
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I 


All  (I)/  C I - 1 ) ! 


Bll(I)  / (1-1) ! 


1 

0 

2 

-0 

3 

0 

4 

-0 

5 

0 

6 

-0 

7 

0 

8 

-0 

9 

0 

10 

-0 

11 

0 

12 

-0 

13 

0 

14 

-0 

15 

0 

16 

-0 

17 

0 

18 

-0 

19 

0 

20 

-0 

21 

0 

22 

-0 

23 

0 

24 

-0 

25 

0 

26 

-0 

27 

0 

28 

-0 

29 

0 

30 

-0 

31 

0 

32 

-0 

33 

0 

34 

-0 

35 

0 

36 

-0 

37 

0 

38 

-0 

39 

0 

40 

-0 

41 

0 

42 

-0 

43  • 

0 

44 

-0 

45 

0 

46 

-0 

47 

0 

48 

-0 

49 

0 

50 

-0 

. lOOOOOOOOOOCOOOE  01 
. 125COCCCOOOOCOOE  00 
.35 1562500000000E-01 
•122070312500000E-01 
.467 30 041503906 3E-02 
. 189256668090820E-02 
. 795 140 86246490 5E-0 3 
. 34280 30759096 15E- 03 
.150645882968092E-03 
.67 186 20 3978053 5E-04 
.30 3 177 74 545096 7E- 04 
.138121266264335E-04 
.634254773036746E-05 
. 29 3202095 52 3644E-05 
. 13631653548261 3E-05 
.63690 1 146338206E-06 
. 298 858 39923 389 5E-06 
. 14076851071 1813E-06 
. 665283277862541 E- 07 
.3  15>3  64545496475  E-07 
.149896710531293E-07 
.714218736970249E-08 
. 341061581781506 E- 08 
.163196999789118E-08 
.782339784145318E-09 
. 37 5679564 34658 2E-09 
. 180684642 54 1690E- 09 
.870272909635814E-10 
•419734622392911E-10 
. 20 269 289360 2046E- 10 
.979963836984338E-11 
.474303516833861E-11 
.229798664344921E-1 1 
.111443911484997E-11 
.540951252872135E-  12 
.262802950502473E-12 
. 127776781778835E-12 
•62 17334460367 19E- 13 
•302739840197069E-13 
. 147513520096024 E- 13 
.7192436 5 540569 3E- 14 
. 350904046930 15 7E- 14 
•171299459984542E-14 
. 3 36694 563 53996 3E- 15 
.408893411120479E-15 
•199928685770698E-15 
.978030155285417E-16 
.478665845012651E-  16 
.234372789238237E-16 
.11480703737 7 268 E-16 


0. lOOOOOOOOOOCOOOE  01 
0.375000000000000E  00 
-0. 585937500000000E-01 
0. 170898437500000E-01 
- 0.6008 148 1933 593 8E-02 
0.231 31 3705444336E-02 
-0.9397U928367615E-03 
0.39554201 0664940E-C3 
-0.1 707320006971 71 E-03 
0.750904632 695 892 E-04 
- 0. 335091 192 340542 E-04 
0. 151275672575224E-04 
-0. 689407361 996464E- 05 
0. 3 1 6658263 165535E-05 
-0. 1 4641 405662 9473E-05 
0.680825363327048E-06 
-0.318139586281 243E-06 
0. 149299935603438E-06 
-0.7032 99465 168972 E-07 
0. 3 3241 1277685473E-07 
-0. 15 758372 132 7770E-07 
0.749058675359042E-08 
-0.35692491 11 66692 E-08 
0.1 704501 99779746E-08 
-0.81 56308 38789800E-09 
0. 39 101 34241 15830E-09 
-0.187770314798227E-09 
0.903113396791 883E-10 
-0.434997699570835E-10 
0. 209804924956 504E-10 
-0. 101318295010245E-10 
0.489854451 81 2020E-11 
-0.23 709386003 84 11E- 11 
0.1 1487295491 5 304E-11 
-0. 55 7099051 465333 E- 12 
0.27042042 732 8632E -12 
-0.131376127744436E-12 
0.6387672 3907882 0E-1 3 
-0.310812902602324E-13 
0. 15134504009851 8E-1 3 
-0.737452355542546E-14 
0. 35956834438522 3E- 14 
-0. 17542715781 5494E- 14 
0.8563814 94 446786E-15 
- 0.4 182932 5965 1984E-1 5 
0. 204421 465226220E -15 
-0.999525323533448E-16 
0. 4889597341 52 708E- 16 
- 0. 239306953222 199E  -16 
0. 1 171 741 92787109E-16 


Table  I.  Coefficients  for  Asymptotic  Series 
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These  series  can  be  summed  by  convergent  factorial  series 
algorithm  described  by  Wasow: 


using  an 


r-p+ 1 


j x(x+l) (x+2)  . . . (x+r) 


where  T denotes  the  Stirling  numbers  of  the  first  kind. 


Now,  K (x) 
o 


\i/2 

7T  \ -X  „ 

— 1 e S 
2x  / o 


where  S = 1 - -Tr5-  + T 

o 1 ! (8x)  o 

k A,  A A 

t = y'-4-  = — + — + ... 

o j 2 3 

j=2  x x 

Applying  Wasow' s algorithm  to  these  terms, 

A-,  / r 1 r 2 r 3 

_2  _ / __o _1 _2 

^2  2lx(x+lj  + x(x+l)(x+2)  + x(x+l) (x+2) (x+3)  + ' ' 

(23  4 

^o V . V 

x(x+l)  + x(x+l)(x+2)  + x(x+l) (x+2) (x+3)  + 
Therefore,  To  can  be  expressed  as 


T = y 

O “ x(x+ 


(x+1)  . . . (x+r) 


where  V = A.  fr  + A_  Fr  _ + A,  fr  _ + . . 

o,r  2 r-1  r-2  4 r-3 


These  coefficients  can  be  calculated  and  stored  in  the  memory  of  the 
computer  for  recall  on  demand. 

The  calculations  for  these  coefficients,  involving  Stirling  numbers, 
lead  to  very  large  numbers  in  the  computation  of  high-order  terms. 

Since  the  Stirling  numbers  are  always  greater  than  or  equal  to  one, 
we  modified  them  for  optimal  use  of  the  full  range  of  the  computer. 

3 Asymptotic  Expansions  for  Ordinary  Differential  Equations , W.  Wasow, 
Interscience  Publishers,  John  Wiley,  NY,  1965,  p.  330. 
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The  Stirling  numbers  were  modified  in  the  following  way: 


1 125 

S = F,  F = scale  factor,  such  as  10 

S"  = Sj'Vtn-l) 

si  - si\ + sr1/(n-^ 

The  scale  factor  and  the  number  of  modified  Stirling  numbers  which  can 
be  calculated  are  machine-dependent.  The  computers  at  BRL  have  a range 

from  IQ'155  to  10^°,  single  precision,  which  is  larger  than  the  range 

125 

of  most  computers.  As  can  be  seen  from  Table  II,  for  F = 10  and 

-135  125 

n = 150,  the  modified  Stirling  numbers  range  from  10  to  10  . The 

process  of  scaling  the  Stirling  numbers  in  this  way  must  then  be  reversed 
in  calculating  each  term  of  the  factorial  series. 


By  this  transformation, we  obtained  accurate  results  (15  significant 
digits)  for  x > 6 by  summing  150  terms.  Similar  accuracy  could  be  ob- 
tained on  most  computers  using  double  precision. 


Similarly,  K^(x) 


-x 


e 


S 


1 


where  S^  = 


1 * TTT8XT  * Ti 


CO  V 


= E 

r=l 


1 ,r 

x(x+l) 


(x+r) 


where  V,  = B.rr  , + BTrr  „ + B,rr  _ + 
l,r  2 r-1  3 r-2  4 r-3 


The  results  for  (x)  were  equally  accurate. 


are: 


4 


The  asymptotic  series  for  the  ordinary  Bessel  functions,  x _<  25, 


J 

o 


(X) 


[P  (x)  cos(x 


J)  - Qo(x)  sin (x  - J)] 


Jl(x)  = (§^)  [P  j (x)  cos(x  - ^)  - Qx  (x)  sin  (x  - ^-)  ] 


4 Bessel  Functions , Part  I,  published  by  British  Association  for  the 
Advancement  of  Science,  University  Press,  Cambridge,  England,  1937 , 

p.  202. 
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Y (x)  = — 
\ Ttx, 


[P0(x)  sin (x  - J)  + Qo(x)  cos (x  - J)] 


Y1(X)  = \h)  [Pl(x)  sin^x  - ^ + Q]  00  cos(x  - ^-)]  , 


,2.  2 2 2 2._2 

where  P (x)^  1 - - — - + 1557 

o 2 A 

2!(8x)Z  4!(8x)4 


k C. 

= V -JL 
A 2j 
j=o  x J 

and  0 f x’W Li...  4.  j2'32;52  1 2 ' 32  * S2  ‘ 72  ' 92 

° UC8X)  3!  (8x} 8 ^7 "• 

k D. 

= y _J__ 

2i  + 1 
j=o  x J 


that  Co  = |AJ,  C,  . -|A2| C.  - (-lp|A 


Do  = -IV'  D1  ‘ l»3l' 


Similarly, 


A E. 

P1  VE 

1=0  X J 


' ? -k 


1=0  X 


i,  again,  Eq  = | Bq j , Ej  = - | B2 j , 


• , E.  = 


F0  = lBll>  F1  = -lB3l>  • • • , F.  = 

For  the  ordinary  Bessel  functions,  x > 25, 

JQ(>0  = G(x)  sin(x)  + H(x)  cos(x) 

JjM  = M(x)  sin (x)  - N (x)  cos(x) 

Y0(x)  = H(x)  sin(x)  - G (x)  cos(x) 

YjCx)  = -N (x)  sin(x)  - M(x)  cos(x) 


(-Dj|B2J| 


J 


where  G (x) 
H(x) 
M(x) 
N(x) 


(ttx)’1/2  [Po(x)-Qo(x)j 
(irx)'1^  [P0(x)+Qo(x)] 
(ttx)'1^  [Pt  (x)  +Qj  (x)  ] 
l>x)'1/2  [P1(x)-Q1(x)] 


So,  for  x > 25,  the  same  coefficients  are  merely  arranged  in  a different 
manner . 


As  before,  the  results  obtained  were  accurate  to  15  significant 
digits  for  x > 6 by  summing  150  terms.  A sample  tabulation  of  the 
ordinary  Bessel  functions  from  the  computer  is  shown  in  Table  III. 

We  attempted  to  calculate  I (x)  in  the  same  manner  but  the  factorial 
series  diverged. 

5.  HADAMARD  SERIES.  The  factorial  series  for  calculating  I (x) 
and  Ij(x)  diverge  since  the  Laplace  integrals  representing  these  functions 
are  taken  between  finite  limits  and,  therefore,  cannot  be  expanded  accord- 
ing to  the  previous  algorithm.  The  Hadamard  series,  useful  for  large  x, 
was  used  instead  and  has  been  programmed. 

In(x)  can  be  expressed  by:  ^ 


(x)  . CX/2J" 

nl  -*  f (n+ 1/2) T (1/2) 


r 

/ x cos6  . 2n  n 
/ e sin  9 de 

Jo 


After  expansion  and  term-by-term  integration,  the  Hadamard  series  can 
then  be  written  in  the  form 


, . , eX  (2x)  ~ 1/2  £ d/2-n)m  y(n+m+l/2,  2x) 

n X “ r(n+l/2)T(l/2)  m=o  m!(2x)m 

where  y denotes  the  incomplete  gamma  function  and  (1/2-n)  denotes 
Pochhammer's  symbol. 


5 Theory  of  Bessel  Functions,  2nd  Ed.,  G.  N.  Watson , Macmillan  Co., 
N.Y.,  1948,  p.  204. 

b Handbook  of  Mathematical  Functions,  NBS  55,  U.S.  Government  Printing 
Office,  1964,  pp.  262,  504. 
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r 


(a)n  = a(a+l) (a+2)  . . . (a+n+1),  (a)Q  = 1 


Each  term  in  the  expansion  of  these  series  contains  the  incomplete  g^mma 
function,  which  is  expressed  below  in  terms  of  the  Kummer  function. 

y(a,x)  = a * xa  e x M(l,  1+a,  x)  , 
where  M denotes  the  Kummer  function. 

Hence,  after  substituting  and  simplifying,  we  have 

A (1/,2‘n)m  M (l«  n+m+  3/ 2 , 2x) 

Tn ^ = r (n+1/2) T Cl/2)  (n+m+1/2)  m!  * 

The  solution  of  these  series  is  straightforward  and  presented  no 
problems  in  overflowing  the  memory  of  the  computer.  The  calculation 
of  the  Kummer  function  required  many  terms  (250  terms  for  x=75)  to  get 
the  required  accuracy.  The  solutions  of  the  Hadamard  series  seem  to 
have  the  correct  convergent  behavior.  A sample  computer  tabulation  is 
shown  in  Table  IV. 


The  results  were  good  but  not  as  accurate  for  moderate  argument  as 
we  had  hoped.  We  obtained  15  significant  digits  for  x >_  17  by  summing 
25  terms  or  less  in  the  Hadamard  series.  This  is  not  much  better  than 
the  asymptotic  series  given  by  * 


x0(x) 


Ij(x) 


(2irx) 


(2irx) 


1/2 


1/2 


1 + 


1 - 


1 ! (8x) 


12,32 
2! (8x) ‘ 


1*3 

1 ! (8x) 


Ml 


2 ! (8x)  ‘ 


2 2 2 
1Z,3Z’5Z 

+ 3 + 

3! (8x) 


2 2 

r*3*'5‘7 
3!  (8x)3 


When  the  asymptotic  series  were  programmed,  we  obtained  15  significant 
digits  for  x >_  19.  However,  the  Hadamard  series  does  provide  an 
independent  check  on  the  accuracy  of  the  asymptotic  series  used  in 
our  Bessel  function  subroutine. 


4.  FUTURE  PLANS.  Extension  of  the  factorial  series  program  to 
complex  variables  is  now  in  progress.  After  thorough  checking  of  the 
accuracy  and  speed  of  computation,  a decision  will  be  made  on  which 
series  will  be  used  in  our  Bessel  function  subroutine. 

Besides  the  immediate  value  in  verifying  the  accuracy  of  the 
subroutine,  we  are  developing  insight  and  practical  numerical  procedures 
for  calculating  Laplace  integrals  for  both  finite  and  infinite  limits 
of  integration.  These  integrals  arise  in  the  solution  of  ordinary 
differential  equations. 


* 

Reference  4,  p.  271. 
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FINITE  AND  INFINITE  INHOMOGENEOUS  LADDER  NETWORKS 
C.  C.  Yang 

Mathematics  Science  Staff,  Plasma  Physics  Division 
Naval  Research  Laboratory,  Washington,  DC 


T.  N.  Lee 

The  George  Washington  University 
Washington,  DC 

INTRODUCTION 

It  is  well  known  that  by  given  sufficient  spectral  data,  the  entries 
of  a continued  fraction  expansion  relate  intimately  to  the  density  func- 
tion of  the  inverse  Sturm-Liouville  problem. 1>s  The  investigation  of  the 
pole-zero  distribution  of  a continued  fraction  with  each  of  its  entries  a 
different  complex  function  is  significant  because  of  the  simple  implementa- 
tion analytically  and  numerically.  However,  although  traced  back  to  19th 
century,  the  literature  shows  very  little  of  this  kind  of  study.  A 
recent  paper  by  Lee  and  Brown,3  sheds  some  light  on  the  pole- zero  distri- 
bution pattern  of  the  immitance  function  of  finite  inhomogeneous  ladder 
networks  by  using  the  chain  matrix  parameter  method. 

In  this  paper,  continued  fractions  with  complex  function  entries  are 
first  studied  in  a general  setting.  The  pole-zero  distribution  region 
is  described  by  a conventional  root  locus  equation  and  is  found  to  be 
bounded  in  the  corresponding  complex  plane.  The  applications  of  the 
theorems  are  illustrated  by  examples. 

PRELIMINARY  DEFINITIONS 

Let  R+  denote  the  positive  real  line,  C the  whole  complex  plane  and 
P.R.  any  positive  real  rational  function.  Two  polynomials  are  said  to 
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be  relative  prime  polynomials  or  simply  r.p.,  if  they  do  not  have  any 


C.F.  and  z(s),  y(s)  are  two  different  complex  functions  of  s. 


POLE-ZERO  DISTRIBUTION  OF  FINITE  CONTINUED  FRACTION  OF  ARBITRARY  COMPLEX 
FUNCTION  ENTRIES 

Consider  C.F.  F^  ^f^,z(s),  g^s)!  = A^/C^,  then  multiplying  both 
sides  by  y(s)  yields 


which  yields 


Hence, 


Vl(u,)  = fn+ltU[gn+lAn(t")  +y”lcn(u,)]  + An(uj) 


Cn+1  (tu^  = 8n+lAn(uj)  + y"  ' CnU)  * 


A n ( oj)  I _ = A ( oj)  I _ = 1 
n+1  'u)=0  n (u=0 


y'lcn+l(-)U  15  8n+l  + 8i  = iJ1  ®i 


This  completes  the  proof  of  the  lemma. 


The  following  corollary  is  the  direct  consequence  of  the  above 


lemma . 


Corollary  1:  If  the  same  hypothesis  of  the  foregoing  lemma  holds  for  the 


C.F.  Fn  f^s),  g ^y ( s ) , then 


a : the  zeros  of  A.,(s)  and  C..(s)  alternate  with  respect  to  rz(s)y(s),  kl, 

-i  N 

b : A ( s) | =1  and  y(s)  C (s)  | = E g. . 

2 } s I z( s)y( s)  = 0 | |s1z(s)y(s)  = o|  i=l 

The  following  facts  are  observed 


Consider  F^  [f..z(s),  g^Cs)  j = AK(s)/CN(s),  then 

Ajs)  = ( n ^V1  S f z(s)y( s)  +a  ] 
\ i=l  / i=l  L 


V» 

V 


rjr~i  *v  -^anc 


; i 

ir ' 

« 

r 


r - 


y-1cN(s)  = 


’ N- 1 \-  1 / N \ N-l  r 

n Y.  ( T.  ) n z(s)y(s)  + vL  . (h) 

\ i=l  / \ i=l  1 / i=l 


b : if  z(s)  = n (s)/d  (s)  and  y(s)  = n,(s)/d  (s),  where  n (s)  and  d (s), 

3 3 cl  D 3 fl  S 

n^(s)  and  d^(s)  are  r.p.,  then  we  have 

Case  1:  n > tn,  where  n = degree  of  (na(s)n^(s))  , in  = degree  of 

/ N \-l  nN 

(da(s)db(s))  An(s)  = ^ X j ^ | s - za_  j 


T 


1 J 


. i 

M 


b : 

4 


and 


L a ° J |s|na(s)nb(s)  = of 


[na(s)nb(s)]  _1[da(s)db(s)]  '(N_1)c(s)  I 


N 

= £ 8,  > 0 , 


}sjna(s)nb(s)  = Of  i=1  i 


c ; 


for  n > m,  n = degree  of  (n^sJ^Cs) ) , m = degree  of  (da(s)db(s) ) , 


A(s)  and  y(s)  *C(s)  are  polynomials  of  degree  nN  and  n(N-l);  for  m > n, 

A(s)  and  y(s)  C(s)  are  polynomials  of  degree  mN. 

Proof.  The  "only  if"  part:  It  follows  trivially  from  Lemma  1 and  its 

corollary. 

The  "if"  part:  Let  A(s)  and  C(s)  satisfy  condition  a through  c . It 

4 4 

follows  from  definition  B, 

A(s)  = ^ [na(s)nb(s).  + aida(s)db(s)J 

and 

/ N-l  \ - 1 / N \ N-l 

C(S)  \iOi  Yij  (i?!  8i  jnb(s)da(s)  _2i  + Va(  s)V  s)]  * 


£ 

u 


k 


where 


0 < “i  < Yi  < ai+l*  Vi  = N - 1. 


Therefore, 


A(s)/c(s)  = 


( N 

n o. 

\ i=l 

i 

1 \ N 

)i2,l 

z(s)y(s)  + a 1 
1 J 

y(s)  | 

[ N-l  \ 

^ A S 

r1! 

( N ' 

l Z g. 

' i=l  / 

\ N-l. 

1 n fz(s)y(s)  + -y. 
' i=l  l 1 

J 
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KL 


which  yields. 


A(u))/y  1 C(iu)  = A(s)/y(  s)  ^(s) 


z(s)y(s) 


N . K- 1 , 

aNUJ+VlU)  + 

N-  1 , N-2  , 

vr  +cn-2x  + 


+ u)  + 1 

• + C,  U)+  c 
1 o 


where 


/ N \-l  / K-l  \-l  / N \ / N \ 

’ b»-1=(i2iv‘)  U **)■  b°'(i.Ei81)’ 


l.,  b.  >0,  Vi 


Write , 


A(«u)/y  1 c(uj)  = fN(u  + r 1"_1  i : i 1 . 

N gjj  + [A*/y  Vj  1 

where  = a^/b^  >0,  = bN_1/a^  ^aN_i/aN  " bw-2/bN  ] > ° ’ 


A*  = A (to)  - fjjtuy  1 c(oj)  = aJ5_1  oiN  1 + • • ■ 


+ ai'  tu  + 1 , 


y~ 1 Cw  = y"1  C(u))  - g^A*  = b^_£  ajb'2  + + b*  cu  , 


a*,  b*  > 0 . 
x 1 


Moreover,  the  interlacing  zeros  of  A( oj)  and  y c(u>)  in  the  negative  real 
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5 i 


vj 


i . 


r 


-1 


axis  of  tv-plane  implies  that  the  zeros  of  A*'  and  y alternate  on  the 


same  negative  real  axis,  by  hypothesis  a through  c and  Fig.  1 shows  the 

4 4 


locations  of  the  zeros  of  A*'.  Therefore,  the  zeros  of  y * C’:'  and  A4S 


interlace  on  the  negative  u-axis  starting  with  the  first  zero  belonging 
to  A4’"  as  shown  in  Fig.  2 following  the  same  argument.  Hence,  A4f  and  y 
can  then  be  written  as 


A ("+0t) 


i=l 


and 


N-2 


c*  = kc*  5 I1”  + ^l)  * 1 " 1>  2»  “*»  N - 1,  o < a.  < Yi  < «i+1  , 


where 


k* 

a 


■a  i)-  • 


followed  from 


A4:'|  = A(uj)  - f av  ^"cCa))  | = 1 , 


aj  = 0 


U)  = 0 


It  is  easily  seen  that  A4’’  and  C*  satisfy  conditions  a through  c 

4 4 

v7  simply  substituting  back  u.'=z(s)y(s^  to  A*  and  C4%  except  for  the 
degree  of  A4!,(s)  and  y ^C4'"(s)  are  n or  m degree  less  than  that  of  corre- 
sponding degree  of  A(s)  and  y(s)  ^c(s’^  respectively. 

Therefore,  this  process  is  continued  until  N = 1,  in  this  case 


A^(u>)/y  1C1(u>)  = a^co  + 1/^  , 


where  a^  = > 0 and  c^  = g^  > 0 


Q.E.D. 


BOUNDEDNESS  OF  THE  MODULUS  OF  ZEROS  AND  POLES  OF  F^  £f.z(s),  g^C  s)  J 
In  what  follows  the  uniform  bound  is  found. 

Lemma  2:  In  the  C.F.  F^t  ^^(s),  S^'Cs)]  , if  f^,  gieR+,  Vi,  then 

3/Bcu  |y(s)FN  f tz(  s) , g-_y( s)  j j-  z(s)y(s)='j>  > °»  ^JU»  real  and  w r y±  where 
y±  are  the  poles  of  y(s)FN  ^f.z(s),  g.y(s)j  z(s)y(s)=UJ  in  the  u)-plane. 
Proof:  Straightforward  computation  shows 

»«. [yFK(»)l ■ HzF»-»(;)bt»N[f».-'(-)]+ir . v.  * , 

1 " ‘ {<■  K.w]  + i}s 


where  - y^  are  the  poles  of  yF^T(au).  The  foregoing  relation  implies  that 


f d/do)|yFN  ^(uj)  > 0,  then  B/B'ju  yF^C uu) J > 0.  But  N = 1,  2. 


•are  triv- 


ially true  and  hence  the  lemma. 


Lemma  3-  In  the  C.F.  F„  |f.z(s),  g.y(s)  , if  f , g.eR+,  Vi,  then 

N l i l J ii 
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3 


I 

1 


S/M.{y(S)Fi)[f.I(s),  V(.)J}  ,(,)y(g)  ,m<0 


and 


»/8gt{y(.)FK[f1.(.),  giy(s)jj  z(s)y(s)  , „ < 0,  V„  , 
real  and  nonpositive. 

Proof:  It  follows  from  (2), 


S/afN  JypN(u))]  < 0,  V(ueR+  , 


d/d 


% [>'FN(uj)] 


yF*(o>)  ; 


< o,  V 


U) 


Let 


1 1 


yF*(cu)  = g + — — —±— 

fj-lw+  8j-l+ + fiw+  8i 


(7) 


I 

t 


u 

r« 


S i*> 

I * 


: V 


f 


Then,  simple  computation  yields,  for  any  k < N 


3/Bfk  [yFN(uj)]  = (-  D2(N_k)  y 


U) 


[yFjj(uj)  j 2 [yF^jCu))]2---  [yFk(o))J 


3/3gk  [yFN(«,)j  = (-  i)2(N-k)+1 


yF*(  (u) 


yFk(u))l2  fyF*(u))' 


It  is  obvious  that  the  right  hand  sides  of  the  above  equations 

. , 

nonpositive  for  tu£R  This  completes  the  proof  of  the  lemma. 


are 
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Theorem  2:  Let  there  be  two  C.F.,  F^,  i^z(s),  Siy(s)  j and  F*'  f*z(s), 

g*y( s) j . If  ft  > f*eR+,  g.  > g*eR+,  i < N',  then  0^  < 0*.  Vi  < N and 
"Yj  < Vj,  Vj  < N - 1,  where  - C^,  - Of?  and  - » “ Yj  are  the  zeros  and 


the  poles  of  yF^(uo)  and  yF*(<a)  respectively. 


Proof:  Since  we  have  by  Lemma  2,  yF  ( x'  is  a monotonically  increasing 
function  of  oj  ^ by  Lemma  3,  same  function  is  an  nonincreasing 

function  of  f^  and  g^,  V<u,  real  and  nonpositive.  Therefore  all  the 
zeros  of  yF^(u>)  shift  to  the  right  on  the  real  tu-axis  as  all  the  entries 
and  increase  in  value,  as  shown  in  Fig.  3-  This  gives  0^  < Cf*  , 
Vi  < N.  The  result  of  the  poles  of  yF^(uj)  and  yF^(iu)  follows  by  using 


the  same  argument  to  the  function  of 


[>V»>]  '* 


and  it  is  omitted  here. 


Q.E.D. 


It  is  noted  that  if  the  entries  of  the  C.F.  F^  ^f^zCs),  giy(s)j 
are  uniform,  fi  = feR+,  gt  = gsR',  i < N,  then  we  have 


A^uj)  = sinh  (n  + l)  a (cu)  - sinh  Na(uu)/sinh  a(tu) 


y C^(ou)  = g sinh  hTa(x)/sinh  a(cu)  , 


where 


cosh  a(uj)  =2  + fgx/2  . 


Lemma  b:  Let  the  entries  of  the  C.F.  F^.  ^f^z(s),  g^yCs)^  =*  A^/^ 


uniform  as  defined  above,  then 


\ = 2 1 1 - cos 


b : 2 1-  cos 


1^j  /fg  < ak  < 2^1  - cos  ~*J  /fg,  Vk  < N.  (11) 


Proof:  Substituting  the  following  identities  into  (8)  and  (9), 


cosh  Na 


- n f 

k=l  L 


cosh  a - cos 


(2k- 


- 1)tt 
N 


....  _N-1  . , 1"  , krr 

sinh  Na  ° 2 sinh  a U I cosh  a - cos  

k=l  L 


“]• 


results  in  a and  b of  the  lemma  by  using  the  same  argument  as  in 
5 5 

Theorem  1 concerning  to  the  sum  of  two  polynomials  with  interlacing  zeros 
on  the  real  axis. 

As  a consequence,  the  following  theorem- is  established. 

Theorem  3:  Let  - OL.,  - v.  be  the  zeros  and  the  poles  of  the  C.F. 

i i 

FN[fiz(s)’  gi(s)]lz(s)y(s)  = a*  If  V 8ieR+’  then 


fe  < “k < \ " 


j /_fg  < °N  < ^/fg  Vk  < N - 1 , 


where 


inf  f , g = inf  g , 
i<N  ” i<N 


sup  f.  and  g 
i<N  1 


sup  g. 
i<N  1 


Proof:  The  above  result  follows  immediately  from  Theorem  2 and  Lemma  4, 

since  the  zeros  and  the  poles  of  the  C.F.  y(s)FN  £f^z(s), 
gi>'(s)]  J z(s)y(s)  - ^ shift  to  right  on  the  negative  real  cu-axis  by 
the  increasing  in  value  of  all  its  entries  f^,  g^Vi.  Therefore,  these 


zeros  and  poles  are  bounded  in  modulus  by  that  of  the  zeros  and  the  poles 

of  the  two  corresponding  C.F.  of  uniform  entries  each  with  f = inf  f , 

i<N  1 

g = inf  g.  and  f = sup  f.,  g = sup  g.,  respectively. 
i<N  i<N  1 i<N  1 


Q.E.D. 


ASYMPTOTIC  DISTRIBUTION  OF  THE  POLES  AND  THE  ZEROS  OF  THE  SEQUENCE  OF 
THE  CONTINUED  FRACTIONS 

LetH[N£iz(s>’  I be  defined  as  a sequence  of  C.F.  for 

N = 1,  2, • • • • Now  for  each  fixed  N,  the  entries  of  the  corresponding 

C.F.  are  _Tf . = >Ta./Na  and  XTg.  = _tc./N^,  i < N In  what  follows  the  re- 
NiNi  NiNi 

suit  pertaining  to  the  integrated  networks  are  derived. 

Theorem  4:  If  >Ta.  and  >Tc.,  for  all  N and  i,  of  the  above  defined 

NiNi 

sequence  are  bounded  away  from  zero,  then  and  ^y^  = 0(l/*+^)  for 

sufficient  large  N and  k,  where  - JX  and  - v.  are  the  kth  zero  and 

N k N k 

pole  of  the  corresponding  C.F.  s) » jjg^y( s)  j in  the  u)  = z(s)y(s) 

plane. 

Proof:  Since  XTa.  and  _Tc.  are  bounded  away  from  zero  for  all  N and  i, 

N r N l ’ 


we  choose 


a = sup  a.  for  all  N and  c = sup  c.  for  all  N . 
i<N  ^ 1 i<N  N X 


J 


' 


i'i 


Hence  a sequence  of  uniform  C.F.  |fn  |^fNz(s),  with  f^  = a/l/* 

and  ^ = c //  has  the  following  relationship  by  Lemma  4, 

2[i.c°.fekgV)"]  it<a<a>  Vs>  k»-i. 

where  and  ^y^  are  the  kth  zero  and  pole  of  the  uniform  C.F.  F^f^Cs), 
g^y(s)j  in  the  cw-plane. 

It  follows  from  Theorem  3>  we  have. 


2 1 


cos 


(2  k- 


1)  rr 
N 


N“+P 


ac 


< iPk  < NVk 


, Vn,  k < N - 1 . 


The  conclusion  of  the  theorem  follows. 

Q . E .D • 

REMARK 

Theorem  4 is  used  to  investigate  the  asymptotic  behavior  of  the 


EXAMPLES  AND  APPLICATIONS 


Example  1:  Let  the  entries  of  the  C.F.  be  (f^,  fg*  = (8/15* 

8/5,  48/5)  and  (g5>  gp,  g£)  = (5/8,  5/1-5,  1/15) ; z(s)  = s/(s  - 1)  and 
y(s)  = l/(s  - 1) . 

Simple  computation  yields 


[fiz( s) » giy(s)|  = A^/C^ 


15  s6  - 67  s5  + 142  sk  - 179  s5  + 142  s2  - 67  s + 15 


15(8  - 3^  s'"  + o3  - 65  s + 3^  s - 8) 


and  £z(s)y(s),  kj  = |s/(s  - l)2,  kj  which  satisfies  the  root  locus 
equation  of  1 + ^ s = 0,  ke  joURTj  and  is  shown  in  Fig.  4. 


It  follows  that 


a3  =(l/15)[s  - (1  + j {3/2)]  [s  - (1  - j^5/2)]  [s  - (5  + j ^11/6)] 

[s  - (5  - j fll/6)]  [s  - (9  + j \fl9/10)J  [s  - j Nfl9/10)J 

y_1c5  =(l/8)[s  - (3  + j\|7/4)][s  - (3  - jfT/4)][s  - (7  + jfl5/8)] 

[s  - (7  - j fl5/8)]  . 

The  zeros  of  A^  and  y are  shown  in  Fig.  5-  As  can  be  seen  that 

they  alternate  with  respect  to  j^s/(s  - 1)~,  kj  . 

In  example  given  below.  Theorem  1 is  used  to  realize  a ladder  net- 
work with  a given  immitance  function. 


* 


Example  2:  Let  the  poles  and  the  zeros  of  a driving  point  impedance 


Z(s)  be  specified  at  - 1,  - 2,  - 3 + j \j"7/2,  - 3 + j ^15/2  and  at  - 3 + 
j {7/2,  - 3 + j {u/2,  -3  + 3 ^19/2,  respectively. 

Synthesis  procedures: 

1)  Construct  the  pole-zero  plot  for  z(s),  as  shown  in  Fig.  6. 

2)  Find  an  arc  as  shown  in  Fig.  7 passing  through  all  these  sin- 
gularities. This  arc  is  described  by  £(s  + l)  (s  +2),  kj  by  inspection 
hence,  let  z(s)  = s + 1 and  y(s)  *=  s + 2. 

3)  Multiplying  out,  results  in 

k (s6  + 9 s5  + 42  + 117  s5  + 206  s2  + 213  s + 105) 

Z(s)  . 

k (s^  + 8 + 31  + 58  s2  + 84  s + 48) 


4)  Since  A ( s ) j j' 

{sl(s  + 1)  (s  +2)  = oj 


= 1,  yields  k = 1/15,  and 

& 


(s  +2)"^c(s)|  4 = 4,  yields  k = 1/2  (note  that  the 

js|(s  + l)  ( s + 2)  = o|  C 

number  4 is  arbitrarily  assumed  which  happens  to  be  the  total  capacitance 

of  the  ladder  network.),  therefore,  we  have 


(s  + 2)z(s)  I 


.cu  + 1H  (D 


(s  + l)(s  + 2)  = to  15(<d  + 2)(cd  + 4) 

5)  Hence  C.F.  gives  (f^,  fg,  = (2/15,  5/15,  24/15)  and  (g^, 
gg*  8j)  = U5/5,  15/12,  1/4).  Fig.  8 shows  the  corresponding  network. 


CONCLUSION  AND  SUMMARY 


The  complete  pole-zero  pattern  of  a continued  fraction  of  nonunifona 
entries  is  established  using  arcs  in  the  s- plane  defined  by  a simple  root 
locus  method. 

A process  of  decomposition  of  rational  functions  satisfying  the 
foregoing  pole-zero  patterns  into  continued  fractions  is  used  to  syn- 
thesize general  inhomogeneous  ladder  networks. 

The  analysis  and  synthesis  results  established  are  being  extended 
to  the  case  of  infinite  ladder  networks  (N  -*  «°)  and  the  problem  of  the 
transition  between  lumped  and  distributed  networks. 
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interlace  of  the  zeros  of  y C11,  and  A* 
Figure  2 


Location  of  the  poles  and  zeros  of  Z(s) 
Figure  6 
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Locations  of  all  the  singularities 
Figure  7 
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ABSTRACT.  A method  of  exploiting  VP  (variable  power)  splines 
for  the  purpose  of  automatic  numerical  quadrature  is  presented.  The 
essence  of  the  adaptive  method  given  here  is  to  select  mesh  points 
near  the  node  where  an  upper  bound  on  the  local  area  discrepancy  be- 
tween the  trapezoidal  estimate  and  the  local  VP  spline  estimate  of 
the  integral  Is  a maximum.  A comparison  is  made  with  Gaussian  quad- 
rature for  an  Integral  containing  a parameter. 

1.  INTRODUCTION.  The  term  "Automatic  Integrator"  refers  to 
numerical  Integration  algorithms  which  adapt  themselves  to  the  parti- 
cular situation  at  hand.  Automatic  integrators  are  particularly 
handy  for  obtaining  dependable  integral  estimates  during  computation 
on  a problem  which  may  involve  many  integrals  and  whose  nature  may 
change  from  time  to  time  as  the  parameters  involved  fluctuate.  They 
are  also  useful  in  situations  where  the  integrand  may  be  expensive 
(time  consuming)  to  evaluate  as  is  the  case  with  multidimensional  in- 
tegrals. 

The  basic  philosophy  behind  the  automatic  integration  in  this 
article  will  be  to  spend  some  computational  overhead  time  in  monitor- 
ing the  region  of  the  integrand  where  the  VP  spline  interpolater  is 
making  the  most  significant  contribution  to  the  integral  estimate 
(relative  to  the  linear  interpolater)  and  evaluate  the  integrand  in 
these  significant  regions. 

As  increasingly  more  information  is  accumulated  about  the  inte- 
grand, it  will  be  possible  for  the  algorithm  to  gradually  abandon 
evaluation  of  the  integrand  over  large  regions  of  uniform  behavior 
and  transfer  its  attention  to  regions  where  the  integrand  behaves  more 
abruptly.  This  process  will  generally  produce  a nonuniform  mesh  and 
it  will  be  necessary  to  have  on  hand  an  interpolater  which  is  smooth 
but  stable.  Variable  power  splines  satisfy  this  requirement  since 
they  are  twice  differentiable  and  they  may  be  given  some  local  deri- 
vative control  which  renders  them  less  likely  to  inject  interpolator 
oscillations. 

2.  SUMMARY  OF  BASIC  VP  SPLINE  FORMULAS.  The  interpolator 
functions  used  here  are  the  VP  (variable  power)  splines  given  on  the 
•jth  subinterval  by  Eq.  (1). 
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(1)  k1y1  (x)  = ai+biri+c1r1  +d1(l-r1) 
where  kj  = m^+n^-m^n^  , 

= (x-xj)/£.j,  and 


*1  = xi+1"x1 

The  four  parameters  aj,  bf,  cf  and  dj  may  be  eliminated  in  favor 
of  *i»  yi+1.  y|  and  y]+1  . 

(2)  ai  * kiy1+£1(miqi-(mi-l)yj-yj+1) 

(3)  b1  = A1(-m1niqi+miyj+n1yj+1) 

(4)  cj  = Ai  (niqt-yi-(n-|-1  Jyj+i ) 

(5)  di  = *i(-miqi+(m1-'l)yi+yJ+1) 
where  qi  = (yi+1 -yi )/^i 

If  second  derivative  continuity  is  enforced  at  the  interior  nodes 
and  the  curvature  is  set  equal  to  zero  at  the  end  points,  the  follow- 
ing tridiagonal  system  of  equations  may  be  obtained. 

(6)  (mrl)yj+y2  = m1q1 

(7)  AiyJ_1+BiyJ+CiyJ+1  = Dj  ( 1 <i <N) 

(8)  yN-l+(nN-r1)yN  = "N-l^N-l 

The  coefficients  in  Eq.  (7)  are  given  by  equations  (9-12). 

(9)  Aj  = mi.1(m1_1-l)/(ki_1£j_1) 


(10)  C1  * n1(n1-l)/(ki£1) 


(11)  Bj  = (n^_]-l  )Aj+(uij-l  )Cj 

(12)  = n^_-|A^q^_i+m^C^q^ 


The  solution  to  the  system  described  by  equations  (6-12)  yields 
the  nodal  derivatives  which  Insure  continuity  of  the  second  derivative 
of  the  Interpol ater. 


All  that  Is  needed  now  to  completely  define  the  Interpol ater  Is 
the  setting  of  the  nonlinear  parameter  vectors  m and  n.  The  values  of 
mf-i  and  nl  are  set  by  obtaining  a VP  spline  over  the  restricted  node 
set  [x-f _i | xi,  xi+]].  Setting  the  end  curvatures  equal  to  zero  and 
setting  y<  equal  to  the  slope  of  the  line  through  (x^.y^)  which  makes 
equal  angles  with  the  linear  Interpol ater  on  the  left  and  right  of  x^ 
yields  Eq.  (13). 


(13)  n^/m^  = (tf/t^.]) 


|7qf_i+l)/(q?+l)J 


1/2 


Equation  (13)  sets  the  m's  and  n's  while  assuming  a lower  bound 
of  L on  t.em  l.e.,  either  nj  * L or  m^_i  * L.  This  lower  bound  L must 
be  greater  than  2 and  It  need  not  be  greater  than  3.  Values  of  L 
greater  than  3 tend  to  produce  too  much  flattening  of  the  Interpolater 
between  nodes. 


3.  INTEGRATION  FORMULA.  If  the  VP  spline  Is  Integrated  over  the 
jth  subinterval,  we  may  obtain  Eq.  (14)  after  some  rearrangement  and 
simplification. 
x1+l 

(14)  / yi(x)dx  * (t|/2)(y|«yl4>1)  + A< 

X1 


where 


The  quantity  A*  Is  the  discrepancy  between  the  trapezoidal  esti- 
mate and  the  VP  spline  estimate  of  the  Integral  over  the  ith  subinter 
val.  This  expression  for  ai  Is  not  dependent  on  the  existence  of 
second  derivatives. 

If  qj  Is  between  qj_]  and  qj+-|  and  y^  Is  between  q^  and  q^ 

(k  = 1,  1+1)  and  mj  * n-j  * m,  the  maximum  value  that  |A^|  may  take  on 
Is  1.^ |q^_-| _q-|+-|  | / (6+4/2)  for  an  m of  1 + /2. 

4.  SIGNIFICANT  NODES.  An  Initial  mesh  over  the  desired  Inter- 
val of  Integration  must  be  assumed.  This  mesh  may  be  uniform,  or 
prior  analytic  knowledge  of  the  Integrand  may  prompt  the  Insertion  of 
a node  or  two  near  an  abruptness  In  the  Integrand.  In  any  case,  the 
Initial  mesh  may  be  uniform  or  non-uniform  and  may  contain  as  few  as 
three  points. 

The  relative  significance  of  the  various  points  In  the  sample 
must  be  determined  first.  This  will  be  done  by  considering  the  be- 
havior of  a VP  spline  with  zero  end  curvatures  over  the  restricted 
node  set  [x-| _-| , xj,  Xj+j].  Enforcement  of  second  derivative  continu- 
ity at  node  1 yields  Eq.  (15). 


(15)  R1  = n1/m1_1  = (i1A1.1)(q1_1-y])/(yj-q1) 


This  equation  Implies  Eq.  (13)  with  yj  selected  as  previously 
mentioned.  If  Eq.  (14)  Is  used  with  the  conditions  for  zero  end  cur 
vatures,  two  simple  Integral  formulas  may  be  obtained  for  the  three 
point  VP  spline. 


06) 

r 

y^ifxjdx 

•1 

* {*i_1/2)(y1-l+y1)  + ul 

(17) 

/ 1+1yi(x)dx  = 
X1 

Ui/2)(yt+yi+1)  + v1 

where 

U1 = 

Uf_]/2)(qj_]-yj  )/(m.|_i+l) 

and 

vi  = (t^/2)(yj-q1)/(ni+l) 
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The  two  area  discrepancy  terms  Uj  and  vj  will  be  used  to  deter- 
mine the  significant  points  of  the  sample. 

At  this  point,  we  want  to  notice  the  effect  of  yj  on  ui  and  v-j 
as  It  varies  between  the  left  and  right  difference  quotients  qi_i  and 
q^  which  are  taken  to  be  the  reasonable  limits  for  the  assignment  of 
y{  locally. 

From  Eq.  (15)  we  see  that  as  yj  approaches  qi>  Ri  approaches 
Infinity.  The  value  L Is  therefore  assigned  to  mi_i  as  ni  becomes  p 
Infinite.  The  quantity  uj  therefore  approaches  Its  extreme  value  u-i 
as  v-j  approaches  zero. 

(18)  u*  = (tf_i/2)(qi_i-q^)/(L+l) 


Similarly,  as  yi  approaches  q-j _-|  • Ri  approaches  zero.  Hence,  n-j  Is 
assigned  the  value  L as  m^.i  becomes  Infinite.  We  therefore  have  Vj 
approaching  Its  extreme  value  v*  while  ui  approaches  zero. 

(19)  v*  « (tf/2)(qi.1-q1)/(L+l) 


These  extreme  values  of  Uj  and  v*  gives  us  the  significance 
weights  that  we  will  assign  to  the  nodes  in  the  sample. 

(20)  wi  « I ^r^i-il 


5.  INTEGRATION  ALGORITHM.  The  weight  given  by  Eq.  (20)  Is  pro- 
portlonal  to  the  sum  of  |u^|  and  |vf|;  It  Is  an  easily  calculated 

measure  of  the  possible  disagreement  which  may  exist  between  the  VP 
spline  estimate  of  the  Integral  locally  and  the  linear  estimate.  It 
behooves  us,  therefore,  to  examine  the  Integrand  more  closely  near  the 
node  where  wi  is  presently  the  largest.  An  algorithm  for  automatic 
Integration  may  therefore  be  summarized  by  the  following  procedural 
outl Ine. 

I.  Generate  an  Initial  (not  necessarily  uniform)  mesh  over  the 
interval  of  integration,  evaluate  the  Integrand  and  compute 
the  trapezoidal  estimate  of  the  integral. 

II.  Compute  the  jth  nodal  significance  weight  according  to  Eq. 
(20)  for  1 <1 <N . 

III.  Find  the  node  where  w^  Is  the  largest. 
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IV.  If  the  maximum  weight  Is  less  than  a given  fraction  of  the 
trapezoidal  estimate  or  If  the  number  of  functional  evalua- 
tions exceeds  a given  amount,  skip  to  VII.,  otherwise  con- 
tinue to  V. 

V.  Evaluate  the  Integrand  at  the  midpoint  of  the  ^ th  (1-1^) 
subinterval  If  Is  larger  (smaller)  than  *i_i. 

! ‘ 

| ■-  VI.  Update  the  x and  y arrays  and  the  trapezoidal  estimate  and 

recalculate  the  three  appropriate  nodal  weights.  Return  to 
step  III. 

VII.  Set  the  m's  and  n's  according  to  Eq.  (13). 

VIII.  Compute  the  nodal  derivatives  using  equations  (6-12). 

I 

ft 

IX.  Compute  the  VP  spline  Integral  estimate  using  Eq.  (14). 

6.  A TEST  CASE.  The  following  Integral  containing  a parameter 
Is  considered  here  as  a test  case;  It  Is  obtained  from  a Wei bull 
probability  density. 

• 2 l -I  b 

(21)  / k(b)xb  'e  x dx  = 1 

0 

ob 

where  k(b)  = b/(l-e-^  ) 

As  b becomes  large,  the  integrand  will  become  a tall  spike  cen- 
tered near  1.  The  performance  of  VP  spline  adaptive  Integration  will 
be  compared  with  that  of  32  point  Gauss -Legendre  quadrature.  It  Is 
obvious  that  any  quadrature  formula  using  a constant  mesh  may  be  de- 
feated by  this  Integral  If  b Is  chosen  large  enough.  The  purpose  of 
the  comparison  Is  therefore  not  to  belabor  this  fact  but  to  Indicate 
that  the  adaptive  method  Is  capable  of  handling  even  this  pathological 
case  accurately  and  stably. 
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The  following  error  table  was  computed  for  an  L of  2.5.  Only  32 
functional  evaluations  were  made  for  the  VP  spline  Integral  estimates. 


b VP GAUSS 


2 

.00012 

4 

- .000019 

6 

.00016 

8 

.00028 

10 

.00054 

12 

.00048 

14 

.00014 

16 

.00010 

18 

.00041 

20 

.00066 

22 

.00013 

24 

.000018 

26 

.00048 

28 

.00034 

30 

.00037 

40 

.00071 

50 

.00019 

70 

.00031 

90 

.00043 

no 

.0010 

130 

.00074 

150 

.00062 

-.00000000000000036 

-.00000000000000014 

.0000000000060 

.00000030 

-.000043 

.00011 

.0032 

.0069 

.0034 

-.012 

-.04 

-.079 

-.13 

-.18 

-.23 

-.47 

-.60 

-.78 

-.90 

-.95 

-.98 

-.99 


For  well  behaved  Integrands,  Gaussian  quadrature  seems  to  be  un- 
beatable - as  evidenced  by  the  early  entries  In  the  table.  The 
Gaussian  accuracy  deteriorates,  however,  as  its  mesh  becomes  less 
capable  of  detecting  the  spike.  By  the  time  b has  reached  a value  of 
150,  Gaussian  quadrature  has  "lost"  99*  of  the  Integral  value.  Adap- 
tive VP  spline  integration,  although  not  as  accurate  as  Gaussian  for 
small  values  of  b,  displays  a uniform  error  pattern  which  Is  independ- 
ent of  b over  a considerable  range. 

Needless  to  say,  a much  better  parametric  study  than  has  been 
done  here  could  be  done  for  a variety  of  integrands.  Fortran  list- 
ings of  relevant  subroutines  are  given  here  as  an  appendix  for  those 
Interested  In  using  adaptive  integration  in  a practical  setting  or  for 
those  who  might  be  able  to  do  a more  complete  parametric  study. 
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DERIVATIVES  WHICH  INSURE  SLCONf)  DERIVATIVE  CONI 
OF  1HE  VARIABLE  POWER  IVPI  SPLINE 
NPNTS, X.Y-PATA 
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ABSTRACT.  The  usual  method  of  computing  a rotation  matrix  as  a function  of 
the  Euler  angles  is  discussed.  On  a digital  computer  these  angles  must  be 
obtained  by  a numerical  integration  of  the  angle  derivatives,  which  are  func- 
tions of  the  angular  velocity  components  of  the  rotating  coordinate  system. 

The  numerical  integration  in  effect  imposes  a rotational  motion  with  constant 
angular  velocity  over  a time  interval  of  length  equal  to  the  integration 
step-size.  This  constancy  of  the  angular  velocity  is  exploited  to  formulate 
a simple  secondorder  differential  equation  for  the  orthogonal  matrix  describing 
the  rotation.  The  equation  is  easily  solved  exactly,  and  gives  an  expression 
for  the  matrix  at  the  end  of  an  integration  interval  as  a function  of  the  matrix, 
and  of  the  angular  velocity  components,  at  the  beginning  of  the  interval.  The 
second  method  avoids  some  of  the  difficulties  of  the  Euler  angle  method,  and  can 
be  usefully  applied  in  digital  simulations  of  rigid-body  motion. 

1.  INTRODUCTION.  A mathematical  model  of  the  motion  of  a rigid  body  requires 
information  regarding  the  relation  between  two  cartesian  coordinate  systems, 
one  of  which  is  rotating  with  respect  to  the  other.  This  information  is  contained 
in  the  nine  elements  of  the  matrix  R describing  the  change  of  basis  from  one 
coordinate  system  to  the  other.  The  physical  requirement  that  the  magnitude  of 
a vector  be  invariant  under  a change  of  basis  due  to  a rotation,  imposes  the 
following  mathematical  condition  [1]  on  R: 

RRT  = I = RTR  , (1.1) 

where  I is  the  identity  matrix,  and  the  T-superscript  denotes  the  matrix  transpose. 
This  condition  is  referred  to  as  the  orthogonality  condition,  and  R is  said  to  be 
an  orthogonal  matrix. 
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Equation  (1.1)  represents  nine  linear  equations  in  the  nine  elements  of  R, 

T T 

which  would  uniquely  determine  those  elements  but  for  the  fact  that  RR  = I = R R 
is  a symmetric  matrix.  Due  to  this  symmetry,  only  six  of  the  equations  are 
linearly  independent.  The  three  undetermined  elements  serve  to  parameterize  the 
(infinite  number  of)  different  rotation  matrices,  and  the  set  of  all  such  matrices 
constitutes  the  three  parameter  group  of  orthogonal  matrices. 


A popular  choice  for  the  parameters  is  a set  of  three  angular  coordinates 
0j  , 02  , and  03,  known  as  the  Euler  angles  [2].  With  this  choice  the  matrix  R 
can  be  written  as  a product  of  three  separate  rotations,  through  each  of  the  three 
Euler  angles.  At  least  two  potential  difficulties  accompany  this  parameterization. 
The  first  is  a matter  of  economy  of  computation.  Once  the  values  of  the  Euler 
angles  have  been  determined,  one  still  must  compute  the  matrix  elements  of  R as 
sums  and  products  of  trigonometric  functions  of  the  angles.  Such  computations 
can  become  very  time-consuming,  and  therefore  expensive,  on  a digital  computer. 

The  second  problem  is  of  a mathematical  nature.  It  can  be  shown  that  for  a given 
sequence  of  Euler  rotations,  the  angular  velocity  components  co.,  i = 1,2,3,  in 
the  rotating  basis,  can  be  expressed  as  linear  functions  of  the  Euler  angle 
derivatives  0^,  i = 1,2,3.  That  is,  at  any  time  t,  one  has  relations  of  the 
following  form: 


w.(t)  = EG  (0  (t), 0,(t))  6 (t)  , 

1 z J J 


1,2,3, 


(1.2) 


where  all  summations  are  understood  to  be  from  1 to  3,  on  repeated  indices  of  the 
summand.  (The  coefficient  matrix  G depends,  in  general,  only  upon  the  last  two 
rotation  angles  of  the  rotation  sequence.)  To  determine  the  angles,  one  must 
first  solve  (1.2)  for  the  derivatives  of  the  angles,  and  then  integrate  these 
derivatives.  The  solution  of  (1.2)  for  the  derivatives  involves  inverting  the 
matrix  G.  However,  for  certain  values  of  the  Euler  angles,  the  determinant  of 
G vanishes,  hence  G ^ does  not  exist,  and  the  Euler  angle  method  fails  for  those 
values  of  the  angles. 


The  following  observations  are  important  for  developing  an  alternate 
method  of  computing  a rotation  matrix.  In  a digital  model  all  integrations 
are  performed  numerically.  Typically,  a numerical  method  requires  for  the 
computation  of  the  value  of  a variable  the  previously  calculated  value  of  the 
variable  and  its  derivative.  For  illustrative  purposes,  consider  a numerical 
integration  based  on  a first-order  Taylor's  series.  Assuming  the  values 
0^(0)  and  6^(0),  i = 1,2,3,  to  have  been  computed  at  the  beginning  of  an 
integration  interval  (which  we  take  for  convenience  to  be  t = 0) , this  method 
comput  s the  following  values  for  the  Euler  angles  at  the  end  of  an  integration 
interval  of  step-size  x : 


0i(x)  = ei CO)  + x0.(O)  , i = 1,2,3. 


(1.3) 


For  values  of  the  Euler  angles  for  which  the  coefficient  matrix  G in  (1.2)  is 
non-singular,  we  find  from  (1.2): 

0.(0)  = EgT j (02(O),e3(O))  u (0)  , i = 1,2,3.  (1.4) 

Substituting  (1.4)  into  (1.3)  then  yields  for  the  new  values  of  the  angles 

0,  (t)  = 0 (0)  + xl  G~]  (0  (O),0  (0))  co.(0)  , i = 1,2,3  . (1.5) 

In  (1.5)  the  angular  velocity  dependence  of  the  new  values  involves  only  the 
previous  values  uk(0).  Since  the  elements  of  R(x)  can  be  constructed  as  functions 
of  the  0^(t),  the  values  w.(0)  are  the  best  values  of  the  angular  velocitv  com- 
ponents available  for  computing  R(x) . Hence,  for  digital  computation  purposes 
the  angular  velocity  components  can  be  considered  to  have  the  constant  values 
to  j (0)  on  time  intervals  equal  in  length  to  the  integration  step-size,  that  is, 
for  all  te [0  ,x ] . 

This  constancy  of  the  angular  velocity  on  integration  intervals  allows  us  to 
formulate  and  solve  a simple  second-order  differential  equation  for  R.  The  solution 


allows  a direct  computation  of  R(t)  as  a function  of  the  initial  matrix  R(0) , 
and  the  angular  velocity  components  (0) , j = 1,2,3.  For  the  case  R(0)  ■ I 
(that  is,  when  the  two  coordinate  systems  initially  coincide),  the  result  is  the 
well-known  expression  [3,4]  for  the  matrix  describing  rotations  about  an  arbitrary 
fixed  axis.  Although  the  method  we  describe  is  thus  fairly  well-known  (it  was  in 
fact  developed  for,  and  is  being  successfully  applied  in,  a large  digital  missile 
simulation  [5]),  the  derivation  given  in  Section  3 is  believed  to  be  new  and,  in 
our  opinion,  much  more  straight-forward  than  the  geometrical  arguments  given  in 
the  usual  derivations  [3,4]. 

2.  SOME  PROPERTIES  OF  ANTISYMMETRIC  MATRICES.  By  definition,  an  antisymmetric 

"" x 

matrix  A is  a square  matrix  satisfying  the  identity  A = -A.  From  this  identity 
one  can  easily  deduce  the  following  general  form  for  a 3><3  antisymmetric  matrix: 


a2  _al 


Introducing  the  Levi-Civita  permutation  symbol  ^ (£x23  = *■»  eijk  = ^ ^or 

even  (odd)  permutations  of  1,2,3,  and  e - 0 if  any  two  indices  are  the  same), 

1 J h 

the  matrix  elements  of  A can  be  written  concisely  as 

A_  = Se±jk  ^ , i,j  = 1,2,3.  (2.2) 

2 

By  taking  the  product  of  A with  itself,  we  obtain  the  matrix  elements  of  A in  the 


2 2 

A.  . = -a  6 . . + a. a.  , 
13  13  i 3 


where  6..  is  the  Kronecker  delta  symbol  (6..  = 1 if  i = 3,  and  6..-0ifi/j), 
13  ij  iJ 


2 2 2 2 
and  where  a = a^  + + a^ 


Defining  a symmetric  matrix  S(a)  by 


si.(a)  = a.a.  , 


(2.4) 


one  can  write  A as 


2 2 

A = -a  I + S (a)  . 


It  is  easy  to  show,  using  (2.2)  and  (2.4),  that  AS(a)  = 0,  hence,  multiplying 
both  sides  of  (2.5)  by  A gives  the  very  useful  identity: 


3 2 

A = -a  A . 


(2.6) 


3.  THE  DIFFERENTIAL  EQUATION  FOR  R.  Assuming  the  elements  of  R to  be  differen- 
tiable functions  of  time  on  the  interval  [0,t],  we  differentiate  both  sides  of 
(1.1)  to  obtain 


R(t)  RT(t)  + R(t)  RT(t)  = 0, 


where  R is  the  matrix  containing  the  derivatives  of  the  elements  of  R,  and  we 
note  that  1=0.  Defining  a new  matrix  ft  'by 


ft(t)  = R(t)  R (t)  , 


(3.2) 


we  obtain  from  (3.2)  and  (3.1),  and  the  identity  (AB)^  - B^A 
ft(t)  = R(t)  RT(t)  = -R(t)RT(t)  = ~[R(t)RT(t) ]T  = -nT(t)  , 


and  it  follows  that  the  matrix  ft  is  antisymmetric.  By  (2.1)  ft  can  be  written 
in  the  general  form: 


W 


n<t)  = 


o 

-w  (t) 

(^(t) 


0 

-w1(c) 


-w2(t) 

U).  (t) 
0 


(3.3) 


It  is  demonstrated  in  several  textbooks  [6,7]  that  the  elements  of  fl  , defined 
by  (3.2),  can  be  identified  with  the  components  in  the  rotating  basis  of  the 
angular  velocity  vector.  As  discussed  in  the  Introduction,  the  best  available 
values  of  these  components  on  the  interval  [0,t]  are  the  previously  computed 
values  uk  (0) . 

Setting 


!l  = fl(0)  , toj  = u)  (0),  j = 1,2,3,  (3.4) 

and  multiplying  both  sides  of  (3.2)  by  R(t) , using  the  orthogonality  condition 
(1.1),  we  obtain  the  following  first-o- der  differential  equation  for  R: 


R(t)  = flR(t)  . (3.5) 

Since  fl  is  a constant  matrix  on  [0,t],  (3.5)  can  be  differentiated  to  yield: 

R(t)  = fiR(t)  = fl2R(t)  , (3.6) 

where  R(t)  has  been  replaced  by  (3.5)  in  the  last  equation  of  (3.6).  Multiplying 
(3.6)  by  fl  now  gives 

flR(t)  - fl3R(t)  = flR(t)  + w2flR(t)  = 0 (3.7) 


3 

where  we  have  used  (2.6)  for  fl  , and  where 

u2  = ui  + u2  + w3  ' (3.8) 
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Since  u)2  is  a scalar,  it  commutes  with  Q , and  (3.7)  can  be  written  as 


JJ[  R(t)  + w2R(t)  ] = ncQ  = 0 , (3.9) 

where  we  have  defined 

R(t)  + w2  R(t)  = Cq  . (3.10) 

Equation  (3.10)  is  the  familiar  equation  for  a forced  harmonic  oscillator, 
except  that  the  "dependent  variable"  is  here  a matrix  function  R,  and  the  "forcing 

function"  is  an  as  yet  undetermined  matrix  C . It  is  easy  to  show,  using  (3.5)  , 

o 

(3.6),  and  (2.6),  that  CQ  = 0,  so  that  Cq  is  in  fact  a constant  matrix. 
Furthermore,  using  (2.5),  one  can  show  that  Cq  = 0 implies  that  n = 0,  which, 
from  (3.5)  corresponds  to  the  trivial  solution  R(t)  = R(0) , te[0,T].  By  direct 
substitution  one  can  then  verify  that  the  non-trivial  solutions  of  (3.10)  have 
the  general  form: 

R(t)  = C /w2  + C sinwt  + C.cosuit  , (3.11) 

o 1 

where  and  are  arbitrary  constant  matrices.  To  determine  the  constant 
matrices  in  (3.11),  we  evaluate  R and  its  first  two  derivatives  (found  by 
differentiating  (3.11))  at  t = 0,  and  compare  the  results  with  (3.5)  and  (3.6) 
evaluated  at  t = 0.  The  results  are 

C /w2  = R(0)  + n2R(0)/u2  , 

0 

c1  = aR(o)/co  , 

C2  = -fi2R(0)/u>2  . 
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Substituting  these  expressions  into  (3.11),  we  obtain  the  following  solution 
for  the  rotation  matrix  at  time  t = t : 


R(t)  = [ I + (Q/w)sinioT  + (n2/w2)  (1-coswt)]  R(0)  . (3.12) 

It  is  convenient  to  define  a "transition"  matrix  X by 

X(x)  = I + (fl/a>)sinwT  + (fi2/w2)  (1-cosojt)  . (3.13) 

If  the  matrix  R(0)  is  known,  then  X(t)  defines  the  transition  over  the  interval 
of  length  t,  to  the  new  matrix 

R(t)  = X(t)  R(0)  . (3.14) 


If  the  two  coordinate  systems  initially  coincide,  so  that  R(0)  = I,  then  R(t)  = X(t). 
Using  (2.2)  and  (2.3)  in  (3.13),  we  obtain  the  matrix  elements  of  X in  the  form 


VT> 


S^coswt  + Eeij^(io^/(i))sinu)T  + (oj^Wj/u2)  (1-coswt) 


9 


which  is  a slightly  simplified  form  of  equation  (19)  of  Ref.  4 for  the  elements 
of  the  matrix  describing  a rotation  through  the  angle  cut,  about  an  axis  defined 
by  the  direction  cosines  , i = 1,2,3. 


4.  CONCLUSION.  The  transition  matrix  method  described  in  this  paper  eliminates 
the  inversion  singularity  problem  of  the  Euler  angle  method,  as  well  as  the  numer- 
ical integration  of  the  Euler  angle  derivatives  required  by  that  method.  Also, 
the  only  trigonometric  functions  to  be  computed  in  (3.12)  are  sinwr  and  coswt  , 
hence  computation  time  should  be  reduced  by  the  transition  matrix  method.  If  so 
desired,  the  Euler  angles  can  be  recovered  at  any  time  from  the  rotation  matrix, 
for  they  are  simply  inverse  trigonometric  functions  of  the  matrix  elements.  We 
remark  that  (3.12)  is  approximately  valid  on  any  interval  for  which  the  angular 
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velocity  is  approximately  constant,  that  is,  on  any  interval  where  the  angular 
acceleration  is  "small".  It  would  appear  that  this  formalism  has  significant 
advantages  over  the  usual  Euler  angle  method. 
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ABSTRACT . Fundamental  fields  and  weight  functions  are  presented  in  closed 


form  by  algorithm  and  formula. 


INTRODUCTION . STATES  OF  PLANE  STRAIN.  During  the  last  three  decades 


the  analysis  of  stress  fields  near  the  edges  of  cracks  has  grown  into  a disci- 
pline of  its  own.  Various  methods  for  the  computation  of  stress  intensity 
factors  have  been  developed.  The  use  of  weight  functions  is  one  of  them.  Origi- 
nally proposed  for  states  of  plane  strain  [ 1] , the  method  can  be  extended  to 
three-dimensional  fields  [2,  3,  4],  In  the  sequel  we  shall  do  this  for  the  con- 
figurations of  the  penny-shaped  and  of  the  elliptic  crack.  The  analysis  is  with- 
in the  frame  of  the  classical  theory  of  elasticity.  Using  a rectangular  carte- 
sian coordinate  system  x,y,z  we  denote  the  respective  displacements  by  u,v,w  and 
the  stresses  by  ax,  t Xy  etc.  in  the  familiar  manner.  It  is  useful  to  begin  with 
a review  of  states  of  plane  strain  within  a cylindrical  elastic  body  V with 
generators  parallel  to  the  z-axis.  Figure  1 shows  its  cross-section  in  the 
(x>y)-plane.  V has  mirror  symmetry  with  respect  to  the  (x,z)-plane.  In  the 
same  plane  a crack  with  faces  C+,  c~  extends  from  the  z-axis  in  the  direction  of 
the  negative  x-axis.  The  boundary  of  V consists  of  the  crack  faces  and  of  a 
cylindrical  surface  B.  Let  B be  attacked  by  a load  of  tractions,  the  latter 
acting  with  components  X,Y  in  x-  and  y-direction  respectively  and  with  X,Y  the 
same  along  a generator.  Assuming  mirror  symmetry  of  the  distribution  of  tractions 
with  respect  to  the  (x,z) -plane  and  imposing  the  constraint  w = 0 we  obtain  a 
state  of  deformation  in  V where  u,v  do  not  depend  on  z (plane  strain)  and  where  a 
suitable  disposition  of  rigid  body  motion  makes  u an  even  and  v an  odd  function 
of  y (mode  I).  Let  x = rcos 9 , y = rsin Q define  polar  coordinates  r ,0.  With 
their  aid  the  asymptotic  behavior  near  r = 0 of  the  relevant  field  quantities  can 
be  described  as  follows: 


plane  strain 
mode  I 


Figure  1 


a = 


YTr 


f(0)cosifl  with  a suitable  constant  k and  where  \ 


f(0)  = l _ sin-iosin-^0 


furthermore 


u = y-|r  (k  - cos  0)cos-|  0 


for  a = a 


1 ^ 

f ( 0)  = 1 + sin^Ssin-^©  for  a = 

f(0)  = sin40cos-|0  for  a = x 


xy  ’ 


= “ cos  0)sin!  6 

k - 3 - 4v,  v = Poisson's  ratio 
u = shear  modulus  . 


(1.1) 


(1.2) 


The  constant  k is  known  as  stress  intensity  factor.  The  asymptotic  relations 

(1.1) ,  (1.2)  stay  valid  if  a bounded  and  smooth  distribution  of  tractions  on  C , 
C"  is  admitted  in  accord  with  the  symmetry  of  mode  I.  It  is  customary  to  con- 
sider the  term  r "V2  in  (l.l)  as  a point  singularity  in  the  (x,y)-plane  at  the 
"crack  tip"  r = 0.  Nevertheless  the  singularity  is  along  the  whole  z-axis  as  a 
singular  line  (the  edge  of  the  crack).  This  should  be  kept  in  mind. 

Although  the  stresses  are  unbounded  near  r = 0 the  energy  of  deformation 
per  unit  length  in  z-direction  is  bounded  in  general.  More  precisely  it  is 
bounded  within  any  cylinder  r = rp  of  sufficiently  small  radius  rQ.  If  un- 
bounded the  cause  is  not  asymptotic  behavior  in  accord  with  (l.l)  but  singular 
behavior  of  the  stress  field  at  points  r / 0 of  load  application.  The  latter 
happens  for  concentrated  loads.  If  B is  smooth  and  if  the  tractions  are 
bounded  and  smoothly  distributed  then  the  energy  per  unit  length  is  bounded.  In 
practical  mechanics  no  other  situations  are  encountered.  The  singular  behavior 

(1.1)  of  the  stresses  notwithstanding,  we  are  justified  to  denote  the  field 
responding  to  the  applied  tractions  as  a regular  field. 

Let  now  a field  of  plane  strain  and  of  mode  I have  the  property  that 

u,v  = 0(r  a = 0(r  near  r = 0 (1.3) 

We  shall  call  such  a field  fundamental  if  it  goes  without  body  forces  and  if  it 
displays  no  surface  tractions.  It  is  not  difficult  to  construct  such  a field. 
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Let  t ^ 0 be  an  arbitrary  constant.  We  set  up 

u = u+u,  v=v+v  where 

s r’  s r 

ug  = tr”1//2(i  cos-|  6 + (k  --|)cos-|  ©) 

vg  = tr"1//2(|sin|e-  («  + |)sin-|  e) 

and  where  ur,v  are  the  displacements  u,v  of  a suitably  chosen  regular  field. 

It  so  happens  that  the  displacements  us,vs  create  a stress  field  without  body 
forces;  no  tractions  are  induced  on  C+,  C“  while  a system  of  self-equilibrated 
tractions  shows  up  on  B.  We  choose  ur,vr  so  as  to  compensate  the  tractions  on 
B.  This  establishes  u,v  by  (1.4)  as  the  displacements  of  a fundamental  field. 

The  asymptotic  laws  (1.3)  can  be  rewritten  in  the  vein  of  (1.2),  (1.1).  The 
details  follow  from  the  explicit  form  of  us,vs  in  (1.4).  It  has  been  shown  in 
[l]  that  the  construction  (1.4)  yields  all  fundamental  fields  in  V of  mode  I. 

The  energy  of  deformation  per  unit  length  is  infinite.  More  precisely  the 
energy  is  already  infinite  within  any  cylinder  r = r0,  no  matter  how  small 
rQ  > 0.  We  can  dispose  of  t by  normalizing  the  fundamental  field.  If  t(x+l)  = l 
then 

v = |x|-^  on  C+,  v = - \x\~S  on  C-  (1.5) 

near  x = 0 . 


We  shall  write  u = u^,  v = Vj.  if  the  fundamental  field  is  normalized  by 
(1.5);  setting  u = u , v = vr  we  shall  characterize  a generic  regular  field,  i.e. 
the  meaning  of  ur,vr  will  not  be  restricted  to  (1.4).  Let  us  now  consider  the 
mixed  energy  of  deformation  (per  unit  length )W  associated  with  UfjV^  and  ur,vr  . 
To  be  on  the  safe  side  we  exclude  the  cylindrical  domain  r < rQ  from  V.  In  the 
remaining  portion  the  mixed  energy  can  be  assumed  to  exist.  By  Betti's  theorem 
two  representations  W = Wrf,  W = W^r  of  the  mixed  energy  are  available.  Here 
Wrf  is  the  work  of  the  tractions  of  the  regular  field  through  the  displacements 
of  the  fundamental  one;  Wfr  is  the  work  of  the  tractions  of  the  fundamental 
field  through  the  displacements  of  the  regular  one.  In  either  case  the  tram  _ nr.s 
on  the  cylinder  r = rQ  must  be  taken  into  account.  We  can  write 


- W’  + W'  = W"  - W'' 
rf  fr  rf  fr 


where  primes  refer  to  the  cylinder  r = rQ  and  double  primes  to  tne  t 
V outside  that  cylinder;  the  latter  includes  B and  part  of  "+,  C". 
fundamental  field  exhibits  no  tractions  on  B,  C+,  C"  we  find  - 
ciently  small  r the  left-hand  side  of  (1.6)  can  be  evaluated  wfv 
asymptotic  relations  (1.1),  (1.2)  for  the  regular  field  and  (1.3,, 


337 


fundamental  field.  In  this  context  (l.U)  must  be  supplemented  by  formulas  for 
the  stresses  to  which  us,vs  give  rise.  Without  going  into  any  further  detail 
we  observe  that  specified  stresses  ar,a ^ and  displacements  wr,wf  of  regular  and 
fundamental  field  respectively  obey  the  order  relations 

Vr  • wf  = °(l)>  roaf  • wr  = 0(1)  as  ro  ->  o (1.7) 


on  the  cylinder  r = r0.  Since  W^.f  and  W£r  are  representable  as  line  integrals 
over  the  circle  r = rfJ  the  asymptotic  relations  determine  the  left-hand  side  of 
(1.6)  in  the  limit  rQ  -» 0.  The  final  result  of  this  procedure  is 


£ 


+ Y v„  ) ds 
r f 


(1.8) 


for  the  stress  intensity  factor  k of  the  regular  field.  X ,Y  are  the  components 
of  traction  of  that  very  field,  and  the  integration  in  (1.8)  is  over  the  line  £ 
which  bounds  the  cross  section  of  V in  the  (x,y)-plane,  ds  being  the  length 
element  of  £%  £ is  the  projection  of  B as  well  as  of  C+,  C“  onto  that  plane. 
Details  of  the  derivation  of  (1.8)  can  be  found  in  [1,2];  a different  derivation 
is  in  [3].  It  is  possible  to  extend  (1.8)  to  regular  fields  with  body  forces. 

In  the  special  case  that  the  tractions  appear  exclusively  on  C+,  C-  in  the  form 
of  a pressure  distribution  formula  (1.8)  specializes  into 


k 


m(s)p(s)ds; 


£*"  = projection  of  C+ 


(1.8') 


p = applied  pressure, 


m = 


vf  on  C 


We  call  the  displacements  Uf,Vf  weight  functions.  They  permit  to  represent  k 
as  a weighted  sum  of  the  tractions  Xr,Yr.  The  use  of  a formula  of  type  (1.8) 
for  the  computation  of  the  stress  intensity  factor  k is  advantageous  in  two 
respects: 


(1)  Uf,vf  depend  exclusively  on  the  shape  of  V;  thus  geometry  and  loading 
appear  independently  in  (1.8). 

(2)  the  effort  to  calculate  Up,vf  is  not  higher  than  the  effort  to  calculate 
the  displacements  of  some  regular  field. 


At  a specified  point  s'  the  value  of  m(s’)  in  formula  (1.8')  can  be  interpreted 
as  the  stress  intensity  factor  of  a regular  field  responding  to  concentrated 

pressure 

p(s)  = -ysL  6(  s-s ' ) 

V2 
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r 


- 
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where  5(...)  is  Dirac's  Delta  function.  For  this  reason  one  could  be  inclined 
to  classify  m(s)  as  Green's  function.  Unfortunately  the  interpretation  makes 
the  function  m(s)  an  abstract  from  infinitely  many  fields,  each  characterized 
by  a different  point  s of  load  concentration.  To  compute  m that  way  would 
sacrifice  the  advantage  (2)  which  rests  on  the  circumstance  that  m(s)  is  a 
boundary  displacement  of  only  one  field.  The  term  "weight  function"  was  chosen 
in  order  to  avoid  the  misleading  suggestions  associated  with  the  concept  of 
Green's  function. 

For  some  plane  strain  configurations  of  mode  I in  which  the  crack  faces 
alone  are  loaded  by  some  pressure  distribution  p(s)  integral  equations  have 
been  found  [2]  which  link  p( s)  to  the  crack  opening  displacement  v(s)  = vr  on  C 
in  the  form  b 


p(s)  = ^ J L(s,t)q(t)dt;  q(t)  = * v(t) 


(1.9) 


The  interval  (a,b)  is  identical  with  £+•,  L(s,t)  is  a Cauchy  type  singular 
integral  operator.  The  integral  is  taken  as  Cauchy  principal  value.  An  example 
is 


P(s) 


d f 2q(t)dt 
ds  J t - s 


(1.10) 


-1 


for  the  Griffith  crack  ( - 1 < x < 0;  y = 0)  in  an  infinite  solid.  The 
homogeneous  case  p(s)  = 0 admits  the  solution  q(t)  s 0 only  if  one  insists  that 
q(t)  be  bounded.  If  one  drops  this  condition  then  q(t)  = cm(t)  with  c as 
constant  coefficient  becomes  a solution.  For  the  Griffith  crack  the  homogeneous 
equation  admits  two  solutions  associated  with  the  crack  tips  x = 0,  x = -1, 
namely 

!/2  1/2 

m(s)  « | ±-5|  » ®(s)  = Ivbl  • (l.U) 


2.  FIELDS  IN  THREE  DIMENSIONS.  Let  us  now  generalize  the  states  of  plane 
strain  of  mode  I into  states  of  three  dimensions.  We  shall  assume  a plane 
crack  in  the  (x,y)-plaijie.  _Figure  2a  shows  an  elliptic  crack  as  example.  The 
faces  are  denoted  by  C , C and  the  contour  by  C'.  In  Figure  2b  an  infinite 
crack  occupying  the  half -plane  x < 0 is  represented.  We  shall  assume  that  the 
displacement  field  has  mirror  symmetry  with  respect  to  the  (x,y) -plane;  more 
precisely  u,v  are  to  be  the  same  at  points  (x,y,z)  and  (x,y,-z)  while  w changes 
sign  without  change  of  absolute  value.  This  is  the  generalization  of  mode  I of 
plane  strain.  Finally  we  confine  the  attention  to  those  fields  which  can  be 
derived  from  a Boussinesq-Papkovich  potential  G(x,y,z).  This  potential  is 
harmonic,  i.e. 


= 


G + G + G =0 
xx  yy  zz 


(2.1) 
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Figure  2b 


Here  and  in  what  follows  coordinate-denoting  subscripts  indicate  partial 
derivatives.  The  displacements  and  stresses  are  derived  as  follows: 

u = - zG  - (l-2v)G  , v = - zG  - (l-2v)G  , 

xz  ' x’  yz  y ’ 

w = - zG  + 2(l-v)G 


J 


t * 

u 


t* 


a = 
x 


[( zG  )„  + 2vG  ],  a = - 2u[(zG  )„  + 2vG  1,  \ 

xx / z yyj  * y L yy  z xxJ’  ] 


o — — 2p[  zG  - G I , 
z zzz  zz  J ’ 


X = - 2pzG  , x = - 2m  zG  , 
yz  yzz*  zx  xzz 

= - 2(i [ zG  + (l-2v)G  1 . 

I xyz  ' xv  J 


xy 


(2.3) 


For  the  sake  of  a first  orientation  let  us  consider  the  configuration  of  Figure  2b. 
It  admits  in  particular  states  of  plane  strain,  and  the  asymptotic  relations  of  the 
preceding  section  apply  if  the  roles  of  y and  z are  exchanged.  In  order  to  exhibit 
a more  general  class  of  states  we  set  up 


G(x,y,z)  = F(x,z)cos\y 


(2.4) 


with  seme  real  constant  \ > 0.  The  case  \ = 0 is  that  of  plane  strain.  The 
function  F(x,z)  must  satisfy 


F + F - \F  = 0 
xx  zz 


Defining  polar  coordinates  p,  * by  means  of 
i* 

pe  = x + iz 


(2.5) 


(2.6) 


we  can  rewrite  (2.5)  in  the  form 
P 


V + pF  - FA . - ^p^  = 0 . 


pp  r p <t>  <l> 

It  admits  the  product  solution 

F = F*(\p)cos  $ with 

I3/2(t)  - 3-^  • £ ^ 


(2.7) 


I0/2  is  the  modified  Bessel  function  of  type  I and  of  fractional  order  3/2  . 
Altogether  we  can  write 

G(x,y,z)  = g(x,z)h(\p)cos\y  with 

g(x,z)  = (\p)3//2  cos  = Re[\(x+  iz)]3/2  , ) (2.8) 


h(t) 


_ 3 d sinh  t 
t dt  t 


1 

•i 


t* 

\ 
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h(t)  admits  an  expansion  in  even  powers  of  t;  moreover  h(0)  = 1.  The  function 
g(x,z)  itself  is  a Boussinesq-Papkovich  potential.  Tt  describes  a state  of 
plane  strain  and  of  mode  I.  The  field  of  displacements  and  stresses  is  regular. 
The  y-axis  represents  the  edge  of  the  crack.  As  we  let  p -»0  we  approach  the 
edge.  Asymptotic  relations  of  type  (1.1)  and  (1.2)  after  exchange  of  the  roles 
of  y,z  become  valid  with  an  intensity  factor  k depending  on  y.  This  is  due  to 
the  preponderance  of  g(x,z)  in  the  representation  (2.8)  of  G(x,y,z).  Locally 
the  behavior  of  the  field  near  an  edge  point  y is  given  by  the  field  of  plane 
strain  of  g(x,z)  but  modified  by  the  factor  cos\y.  We  list  in  particular: 


k = k(y)  = k(0)cos\  y 


k /r~ 

u V?P 


^ p (k  - cos  *)cos  2 


1A  k _/T~ 

0*  ’ W Pi  , V ? P 


v = 0(p 


(k  - cos  $)sin  g * 


(2.9) 


(2.10) 


■yl—  f(  «)cOS-i  $ 

Yip  2 


where 


11  ^ 

f ( 0)  = 1 _ — sin^  * sin  0 for  0 = 0 
' ' 2 2 2 > 

11  3 

f ( 0)  = 1 + — sin-  ♦ sin  4>  for  0=0 
' 2 2 2 2 


f(4>)  = 2v 

1 3 

f(  «)  = sin  75  « cos  -2  0 


for  0=0 


for  0 = t 


(2.11) 


Furthermore  the  stresses  Txy,  fyz  stay  bounded.  The  special  potential  (2.8) 
induces  no  tractions  on  the  races  of  the  crack.  This  is  obvious  inasmuch  as 
Tzx>  Tyz  are  concerned.  As  for  0Z  we  derive  from  (2.3)  that  0Z  = 2pGzz= -2p(Gxx+ Gyy) 
on  C+,  C~.  But  G = 0 on  the  faces  and  0Z  = 0 follows.  The  displacements  are 
unbounded  as  p -»<*>.  For  this  reason  the  use  of  (2.8)  must  be  confined  to  domains 
of  bounded  p . 


Still  with  regard  to  Figure  2b  let  us  consider 

X '1 

G(x,y,z)  = Erfc(q)e  cos  y ; q = *y2p  • cos-^  0 . 


(2.12) 


The  function  G is  harmonic.  Writing  for  simplicity  Erfc(q)  = Q(q)  and  observing 
that  q as  well  as  the  product  excos  y are  harmonic  functions  we  find 


(2.13) 


v^}  = eXcos  y (V2!}  + 20  ) , 


= y”(q)(q^+qf)  = - 2qQ’(q)(q^ + qj)  = - qQ'(q)/p 


(2.14) 


Qx  = Q’(q)qx  = qQ’/2p 


(2.15) 


and  altogether  = 0 as  asserted.  The  potential  (2.12)  is  periodic  in  y with 
the  period  2n.  Inspite  of  the  factor  ex  the  potential  as  a whole  and  all  of  its 
partial  derivatives  go  to  zero  as  p -»oo.  For  small  p we  may  use 


G = eXcos  y (l  - —7^  q + O(q^) ) 

Vn 


(2.16) 


in  order  to  determine  the  asymptotic  behavior  of  displacements  and  stresses  as  we 
approach  the  edge  of  the  crack.  The  function  q can  be  taken  as  Boussinesq- 
Papkovich  potential;  as  such  it  leads  to  a state  of  plane  strain.  The  state  has 
di splacements 


u = u = 
s 


w = w = 
s 


£/ 1 

4 Vp 

WF 

4Yp 


•5  cos  | * + (k  - |)cos  i 4> 


•|  sin  | $ - (k  + |)sin  | * 


(2.17) 


A comparison  with  (1.4)  shows  that  us,ws  have  the  asymptotic  properties  of 
the  displacements  of  a fundamental  field  of  plane  strain  and  of  mode  I.  Going 
back  to  (2.l6)  we  can  expect  the  potential  q to  dominate  the  behavior  of  G in  the 
approach  p ->  0.  More  precisely  we  find 


u = a(y)ug,  w = a(y)ws  with  a(y)  — cosy 

Y* 


(2.18) 


as  asymptotic  representations  of  u,w  in  the  case  of  G. 

The  field  of  G has  vanishing  shearing  stresses  tzx,  tvz  on  the  (x,y)-plane. 


We  assert  that  oz  vanishes  on  the  faces  of  the  crack.  As  before  we  find 


a = - 2u£G  with  £G  = G +G  on  C , C 


yy 


(2.19) 


But 


G = e cos  y on  C , C 

and  dG  = 0 follows.  The  displacements  and  stresses  go  to  zero  as  p -»<*>. 
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The  potential  G(x,y,z)  of  (2.12)  gives  rise  to  other  potentials 
G(\x,  \(y-y')>  X.z)  where  \,  y'  are  real  constants  and  also  \ > 0.  These  poten- 
tials can  be  linearly  combined  in  a finite  number  of  terms,  the  combination  co- 
efficients to  be  real.  All  combinations  form  a real  linear  space  of  infinite 
dimension.  Each  potential  of  this  space  yields  a field  of  displacements  and 
stresses  which  we  now  designate  as  fundamental  field.  This  is  a generalization  of 
fundamental  fields  of  plane  strain  and  justified  by  the  asymptotic  relations  of 
type  (2.18)  as  well  as  by  the  absence  of  tractions  on  the  crack  faces. 


We  turn  next  to  Figure  2a  and  disregard  temporarily  that  the  crack  is  to  be 
elliptic.  More  generally  we  admit  as  crack  contour  C'  any  rectifiable  Jordan 
curve  of  continuous  tangent.  The  Boussinesq-Papkovich  potentials  associated 
with  this  crack  configuration  can  be  represented  as  harmonic  potentials  of  single 
layers,  more  precisely  in  the  form 


G(x,y,z)  = - p-(Y_-v') JJf(S>Tl)R~1dg dq  with 


R2  = (x-  s)2  + (y-*i)2  + z2 


(2.20) 


The  integration  is  over  one  of  the  crack  faces.  Of  the  density  function  f(|,q) 
we  assume  continuity  inside  C'  and  furthermore  for  interior  points  (j,q) 


|f(5>n)|  < f d'1/2 


(2.21) 


where  f is  some  constant  and  where  d is  the  distance  from  the  contour  C1  of 
(|,tj).  Formulas  (2.3)  lead  to 


w = 2(l-v)G  = f 


= -f  on  C 


] • 


g(x,y)  on  C+,  C" 


(2.22) 


g(x,y)  = - A |)2  + (y  - Tl)2J~  1,/2'd?  dq. 


(2.23) 


As  in  (2.19),  A stands  for  the  Laplacian  operator  of  the  (x,y)-plan  The 
function  g(x,y)  represents  a pressure  distribution  within  the  crack.  The 
stresses  tzx,  Tyz  vanish  in  the  (x,y)-plane  and  in  particular  on  the  crack.  In 
the  nontrivial  case  f(j,q)  ^ 0 we  call  G and  the  associated  field  fundamental  if 
there  are  no  tractions  on  C , C",  i.e.  if  g(x,y)  a 0.  We  call  G and  the 
associated  field  regular  if  f({,B)  satisfies  a condition  more  stringent  than 
(2.21),  namely 


|f(l,n)l  < fyil/2 


(2.24) 


where  fj_  is  a suitable  constant.  Let  s denote  the  arclength  on  C',  counted  from 
seme  point  of  C'  in  the  counterclockwise  sense  as  one  looks  down  at  the  (x,y)- 
plane.  In  the  neighborhood  of  any  point  s of  C'  we  expect  the  field  of  G to 
show  the  asymptotic  behavior  of  a field  of  plane  strain  for  an  associated  half- 
plane crack;  the  latter  must  have  the  tangent  at  s as  edge  and  must  follow  the 
inner  normal  of  C'  at  s,  In  the  case  of  a regular  field  the  asymptotic 
behavior  will  be  determined  by  a stress  intensity  factor  k = k(s).  In  this 
context  we  list  in  particular  the  asymptotic  relations 

w = k(s)  ^ on  C+  (2.25) 

-l/2 

az  = k(s)(2d)  ' for  z = 0 and  points  outside  the  crack  . (2.26) 

As  for  the  fundamental  field  we  merely  write  the  analogue  of  (2.25)  in  the  form 

w = P(s)d‘]'/2  on  C+  (2.27) 

where  the  intensity  function  f}(s)  depends  on  the  fundamental  field. 


f ! 


T* 


r% 

\ 


In  the  case  of  plane  strain  Betti's  theorem  of  reciprocity  was  applied  to 
the  mixed  energy  formed  by  a regular  and  by  a fundamental  field.  The  procedure 
led  to  formulas  (1.8),  (1.8')  for  k.  The  same  method  can  be  used  for  the  con- 
figuration of  Figure  2a  [2],  This  yields  the  analogue  of  (1.8')  in  the  form 


/ 


k( s)p( s)ds  = 


V7 


II 


M(x,y)g(x,y)dxdy 


(2.28) 


is 


The  factor 
obvious  that 


M(x,y)  is  the  normal  displacement  w of  the  fundamental  field  on  C 
g(x,y)  is  the  pressure  within  the  crack  of  the  regular  field.  It 
one  fundamental  field  does  not  permit  to  determine  the  function  k(s).  We  need 
infinitely  many  or  - for  practical  purposes  - a sufficiently  large  number  of 
linearly  independent  fundamental  fields.  In  order  to  find  such  fields  we  must 


Figure  3 


solve  the  homogeneous  integro-differential  equation 


(2.29) 


for  nontrivial  density  functions  f(|,Tj). 

Assuming  that  we  possess  a fundamental  field  for  the  crack  configuration 
of  Figure  2a  we  can  construct  a fundamental  field  for  a finite  elastic  body  with 
the  same  crack.  Figure  3 shows  a sphere  S around  the  origin.  The  elastic  body 
is  bounded  by  S and  by  the  faces  C+,  C".  In  analogy  to  the  construction  (1.4) 
we  add  to  our  fundamental  field  for  Figure  2a  a regular  field  for  Figure  3> 
such  that  the  modifying  regular  field  has  no  tractions  on  the  crack  while  its 
tractions  on  S annihilate  those  of  the  fundamental  field  of  Figure  2a.  The 
resulting  field  has  no  tractions  on  crack  faces  and  on  S;  it  displays  the 
asymptotic  behavior  of  the  initial  fundamental  field  near  the  edge  C'  of  the 
crack. 

Assume  now  a fundamental  and  a regular  field  for  Figure  3>  both  of  mode  I, 
Let  the  regular  field  be  generated  by  a distribution  of  tractions  on  S.  Under 
these  circumstances  the  analogue  of  (1.8)  is 

Jk(s)P(s)ds  = ~~  jJ(uflCr+vfYr  + wfZr)dS  (2.30) 

c s 

where  Uf,vf,wf  are  the  displacements  of  the  fundamental  field  and  Xr,Yr,Zr  the 
components  of  traction  of  the  regular  one.  The  fundamental  potential  G in  (2.12) 
can  be  used  in  order  to  construct  an  analogue  of  formula  (2.28).  Since  G has 
period  2it  one  should  apply  the  associated  fundamental  field  to  the  analysis  of 
k(y)  of  a regular  field  with  the  same  period  and  the  same  symmetry  with  respect 
to  y.  Moreover  it  will  suffice  to  consider  a slab  0 < y < jt.  Further  details 
can  be  left  to  the  reader. 

3.  PENNY -SHAPED  AND  ELLIPTIC  CRACK.  We  return  to  Figure  2a  and  interpret 
C'  as  an  ellipse  with  half-axes  a,b.  The  ellipse  has  the  equation 

E(x,y)  = 1 - x2/a2  - y2/b2  = 0 . (3.1) 

The  w-zeros  of  the  function 

2 2 2 
T(w;x,y,z)  = 1 - -f-  - -f-  - ^ 
a + u)  b + w 

define  associated  elliptic  coordinates.  The  largest  w-root  of  T = 0 will  play  an 
important  role. 
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At  this  juncture  we  turn  to  the  penny- shaped  crack  by  letting  b = a.  Without 
essential  loss  of  generality  we  assume  a = 1.  Cylindrical  coordinates  r,  0,  z 
will  be  useful.  We  have  here  x = rcos  0 and  y = rsin  0.  The  function  T takes  the 
special  form 

2 2 

m 1 I*  Z / 1 


T = 1 


1 + Ci)  U) 


(3.2) 


The  mapping  (see  also  [7l) 


■+  iz  = cosh  (s  + it)  ; s > 0,  - h*  < t < -|  it 


(3.3) 


permits  to  represent  pairs  (r,z)  by  pairs  (s,t)  in  accord  with  Figure  4. 


t > 0 


s = 0 


t = 0 


s = 0 


t < 0 


Figure  4 

The  representation  is  unique  whenever  z f 0 or  r > 1.  For  points  of  the  crack 
two  different  representations  appear  which  permit  to  distinguish  between  C+  and  C" 
From  (3.3)  it  follows  that  (3.2)  has  the  roots 


2 2 
0)^  = sinh  s , = - sin  t 


(3.4) 


The  following  relations  are  useful: 

r = cosh  s cos  t , z = sinh  s sin  t 

s = t = sinh s cos t /N,  t = - s = -cosh s sin t /N 
r z ' r z ' 

2 2 

N = sinh  s + sin  t 
^F(s)  = (F"  (s)  + tanh  sF'(s))/N  . 


(3.5) 


(3.6) 


f 


■ 


3z  = l/sint=  (l-r2)-1^2  on  C+ 
= -(1  - r2)'1/2  on  C- 


Furthermore 


2 2 2 2 2 
sinh  s < r + z < cosh  s = 1 + slnh  s 


(3.8) 


(3.9) 


In  what  follows  we  establish  an  infinite  family  of  fundamental  potentials 
without  solving  (2.29)  directly.  We  set 


Gn(x,y,z)  = Fn(r,z)  cosn(9-9')  and 
Fn(r,z)  = rnHn(s)  for  n = 0,1,2,... 


(3.10) 


Here  9'  is  a constant  which  may  depend  on  n.  We  shall  try  to  make  Gn  a funda- 
mental potential  through  a suitable  choice  of  Hn(s).  Writing  altogether 

G = rncosn(9-9')  • H (s) 
n n 


we  observe  that  the  factor  preceding  Hn  is  a harmonic  function,  i.e. 

V2 ( rncos  n ( 9 - 9 ' ) ) = A rncos  ( 9 - 0 ' ) = 0 ( 

This  in  turn  together  with  (3.6),  (3.7)  yields  after  steps  of  an  elementary 
nature 

V2Gn  = r11  cos  n ( 9 - 9' ) [h^'(  s)  + (2n  + l)tanh  s • H^(  s)]  /N  ( 

Consequently  we  must  solve 

H^'(s)  + (2n+  l)tanh  s • H^(s)  =0  ( 


(3.11) 


(3.12) 


(3.13) 


We  find 


h;(s)  Jsr 

cosh  s 


(3.14) 


with  some  constant  c.  Integration  of  H^s)  and  a special  choice  of  c yield 


H (s)  =ajn. 
n n 2 


arctan  (sinhs)  - a sinh  s 


1 

E — Hr 

k=l  2to^cosh  As  J 


(3.15) 


°o  = “^“7=  » ak  = a0(-Dk  "k^l  * 

0 ( 1 - v )y2  k 0 k 


We  leave  the  verification  of  (3.15)  to  the  reader.  Note  that  the  definition  of 
the  coefficients  is  independent  of  n! 

Having  established  the  potential  G we  check  on  some  of  its  properties. 

Due  to  (2.2)  n 

w = 2(1  - v)G  on  C+  ( 3.16) 

' nz 

Now  (3. 1*0  and  (3.8)  yield 

G = rncos n (6  - 6' )H'(0)s 
nz  v ‘ n'  ' z 

= crncos  n ( Q - 6' ) • (l  - r2)  on  C+  (3.17) 

In  the  construction  of  Hn  we  have  chosen  the  constant  c of  (3.1*0  such  that 

V2(l  - v)c  = 1 (3.1*+') 


This  choice  implies 

P = cos  n ( 6 - 8' ) 


for  the  intensity  p(s)  associated  with  Gn.  We  still  have  to  verify  that  Gn  does 
not  induce  tractions  on  the  crack.  The  nature  of  G„  makes  it  obvious  that 
Tzx>  Tyz  vanis*1  on  the  (x,y)-plane.  As  for  crz  we  observe  that 


G = rncos n ( 9 - 9' )H  (0) 
n x n 


on  C 


(3.18) 


Due  to  (2.19)  and  (3.11)  a,  vanishes.  Finally  it  can  be  established  that  the  , ,2 
field  of  Gn  has  vanishing  displacements  and  stresses  at  R = ® where  R=  (r2+  z2)  ' ; 
moreover  the  stresses  have  the  order  of  R“3  for  large  R while  the  displacements 
are  in  the  order  of  R-2.  In  this  context  we  refer  to  (3.9)  with  the  consequence 
R ~ es  and  to  (3.1*0,  (3.15)  with  the  consequence 


H (s)  = 0(e-(2n+1)s) 


(3.19) 


for  large  R.  All  of  this  is  compatible  with  the  asymptotic  behavior  for  large  R 
of  the  field  of  the  potential  of  a single  layer.  Gn  admits  a representation 
(2.20)  with  the  density  function 

f = f n = V2rncosn(6- 0')  • (l-r2)"1/2  . (3.20) 
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Formula  (2,28)  takes  the  special  form 


2n  1 


k( 0)cos n ( 8 - 0' )d0  = 


=//A‘- 


r^)-1/2C0S  n {6-9'  )g(r,0)rdrd0 


for  the  stress  intensity  factor  k = k(0)  of  the  regular  field  responding  to  the 
pressure  distribution  g = g(r ,0)  within  the  crack.  Since  C'  is  the  unit  circle 
we  are  justified  to  set  s = 0 on  C'.  It  is  obvious  that  the  formulas  (3. 21) 
for  the  various  n permit  to  calculate  the  Fourier  components  of  k (0)  and  thus 
k(0).  One  can  also  establish  the  following  formula  (see  Fi'jure  5) 


,0' )g(r,0)rdrdQ  where 


k(0')  = $//«  r,0,0' )g(r,0)rdrd0  where 
M = (l-r2)1/2^2  ; d2  = 1+r2  -2rcos(0-0') 


In  this  case  the  intensity  3(s)  is  a Dirac  delta  function  on  C'.  An  extension 
of  the  concept  of  weight  functions  in  the  nature  of  the  case  (3-22)  has  been 
suggested  by  Rice  [3]  in  general  form.  Formula  (3*22)  appears  to  be  the  first 
concrete  example  for  this  idea.  Formulas  for  the  penny-shaped  crack  of  type 
(3.21)  are  well-known  [5,6].  They  can  be  and  were  indeed  derived  by  direct 
analysis  of  the  regular  field  with  the  aid  of  Fourier-Hankel  transforms.  But, 
as  we  have  just  shown,  the  concepts  of  fundamental  fields  and  weight  functions 
permit  to  establish  such  formulas  in  a simpler  and  yet  more  systematic  way. 


Figure  5 


We  conclude  the  discussion  of  the  penny-shaped  crack  with  some  formulas  for  the 
functions  Fn(r,z)  in  the  composition  (3.10).  We  list  without  proof 


+ — F = nF  + DF  . ; (D-n)F  = F , F 

r n n-1  n-1*  n n-l,r  r n- 


2 

:2F  - (-)  F = 0;  f D(D+ l)-n(n  + 1)1f  +F  = 0 
n r n ’ l ' ' v '\  n n,zz 


The  operator  D preserves  harmonicity.  The  third  relation  expresses  the  harmoni- 

city  of  Gn  without  reference  to  the  9 - term.  It  is  possible  to  establish  the 

formulas  of  the  first  line  by  merely  using  az  ~ 0 and  the  asymptotic  behavior  of 

w on  the  crack  in  the  case  of  any  G . 

J n 

Returning  to  the  general  elliptic  crack  we  make  an  extensive  use  of  available 
literature  [7  - ll].  In  particular  Dyson’s  formulas  [8,9]  will  be  applied. 
Following  Dyson  we  write  the  density  function  f in  (2.20)  in  the  form 


f(x,y)  = - 4n( l-v)h(x,y)E‘ 


X-l/2 


(x,y) 


(3.24) 


where  E is  the  function  in  (3-1).  We  are  primarily  interested  in  the  cases 
X = 0 and  X = 1;  h(x,y)  is  to  be  a polynomial  in  x and  y.  Under  these  circum- 
stances the  case  X = 1 will  yield  a regular  potential  and  the  case  X = 0 a 
fundamental  one  for  properly  chosen  h(x,y).  Dyson  himself  admits  more  general 
h(x,y).  The  case  X = 0 is  pertinent  to  the  analysis  of  an  electrically  charged 
disk;  so  far  it  has  not  been  applied  to  elastic  analysis.  We  introduce  the 
following  denotations  and  symbols: 


Q(s)  = s(a"  + s)(b2+  s);  q(s)  = Q1//2(s)  > 0 
for  s > 0 


(3.25) 


2 ’ 
b + s 


psx2  *ay2 


(3.26) 


We  denote  the  largest  w-root  of  T = 0 by  t;  it  is  nonnegative.  These  symbols 
and  denotations  are  unrelated  to  formerly  defined  quantities.  Dyson  and 
Hobson  have  shown  that  the  potential  G of  (2.20)  can  be  rewritten  as  a single 
integral,  » 


xabT(x  + g)  r TX 

G(x,y,z)  = — = / «t-t-  W h(px,qy)  • ds  , (to  » s) 

r(|)xr(x)  ( q(s)  * 


(3.27) 


r 


<•1 


i 

i i 

V 

* 

J 


f’ 


) 


where  is  the  following  differential  operator: 


iW 


n=o  Unnl  (x  + l)(x  + 2) . . . (x  + n) 


(3.27') 


The  symbol  r denotes  the  Gamma  function.  We  replace  XT(x)  by  unity  for  X = 0. 
Since  h(x,y)  is  a polynomial  only  a finite  number  of  terms  in  (3*27)  have  to  be 
used.  M^h(px,qy)  is  therefore  a polynomial  in  x,y  whose  coefficients  are 
functions  of  s.  Let  us  now  consider  G on  the  crack;  in  this  case  t = 0 and  thus 


nabT(\  + -5) 


Ttabr(X+pJ  r X + 

G(x,y,0)  = — = / =r-r  M.h(px,qy)ds  valid  for  C , C"  . (3.28) 

r(§)xr(x)  J q(s;  ^ 


In  the  cases  X = 0,  X = 1 the  function  is  a polynomial  in  x,y.  Altogether 
we  see  now  that  G(x,y,0)  is  a polynomial  in  x,y  on  the  crack,  and  so  is 
- a = A2uG(x,y,d)  = g(x,y).  We  can  write 


g = ^h 


(3.29) 


where  denotes  a linear  operator  which  transform  the  polynomial  h into  a 
polynomial  g.  The  nature  of  the  mapping  depends  on  X . 

Case  X = 1 

This  is  the  case  of  ordinary  elasticity.  maps  the  real  linear  space  of 
all  polynomials  of  degree  < m (real  coefficients)  into  itself,  jf-^h  = 0 for  some 
h ^ 0 cannot  happen.  The  mapping  is  therefore  1-1;  given  g there  is  a unique  h. 
The  mapping  does  not  necessarily  transform  homogeneous  polynomials  into 
homogeneous  ones. 


Case  X = 0 

If  h has  degree  m then  g = ^Qh  has  degree  not  higher  than  m-2.  The  case 
^Qh  = 0 for  h f 0 can  happen.  We  call  such  an  h a fundamental  polynomial.  It 
leads  to  a fundamental  field  G.  Trivial  cases  are:  h = 1,  h = x,  h = y,  h = xy. 

Here  the  reader  is  reminded  of  the  definition  of  the  degree  of  a polynomial  h(x,y). 
If  h(x,y)  is  a monome,  i.e.  h - cxmyn  with  c f 0 then  the  degree  of  h is  m+n.  If 
h is  a combination  of  monomes  we  look  for  the  monome  of  highest  degree;  that 
degree  is  taken  as  degree  of  h.  A polynomial  is  homogeneous  if  all  of  its  monomes 
have  the  same  degree.  The  values  of  h on  E = 0 are  given  by  the  Fourier  poly- 
nomial h(acos  9,  bsin  6);  the  latter  has  degree  < N if  h has  degree  N.  Note  that 
E(acos  9,  bsin  9)  m.  OJ 

For  each  degree  m > 2 two  fundamental  polynomials  h(x,y)  of  degree  m can  be 
constructed  as  follows:  Set 
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(3.30) 


h(  x,y)  = xm  + h^(x,y)E(x,y)  where  = - /qX™ 

Since  ^oxm  has  degree  < m-2  the  polynomial  h^  is  of  degree  < m-2;  consequently 

h^E  has  degree  < m,  and  the  degree  of  h cannot  exceed  m.  But  h = xm  = a cos  t) 

on  E = 0;  this  is  a Fourier  polynomial  of  degree  m.  The  degree  of  h(x,y)  cannot 

be  less.  Thus  h is  seen  to  have  exact  degree  m.  Now  h^E  and  \ = 0,  h^  and  \ = 1 

define  the  same  potential.  Hence 

^(h-jE)  = jf1h1  and  jtQh  = 0 . (3.31) 

This  establishes  h as  a fundamental  polynomial  of  degree  m.  In  the  same  vein 
we  construct 

h(x,y)  = xm_1y  + h2(x,y)E(x,y)  where  £^h2  = - *oxm-1y  . (3.32) 

With  (3.30),  (3.32)  we  have  obtained  two  linearly  independent  fundamental  poly- 
nomials of  degree  ra. 

The  application  of  the  operators  iQ,  £■,  involves  certain  elliptic  integrals. 
The  following  coefficients  are  needed  for  trie  construction  of  fundamental  poly- 
nomials: 


c(£> 

mn 


00 

f -1/2  + / m+ 1/2  n+ 1/2.  (0) 

= J 3 ' P ' * ds;  cmn  = 


ran 


(3.33) 


m,n,/  run  through  the  nonnegative  integers.  The  coefficients  satisfy  the  recur- 
sions 


(2m+l)c  - (2m+2n+l)c  + (2n+l)c  . = 0 

' m+l,n  mn  m,n+l 

2 2 2 

c = tc  . + (1-t  )c  . with  t = a /(a  -b  ) 

mn  m-l,n  m,n-l  ' 


(3.34) 


Up  to  degree  3 fundamental  polynomials  can  be  homogeneous.  This  is  no  longer  so 
from  degree  four  on.  We  give  some  polynomials  below: 

Fundamental  polynomials  up  to  degree  4 


m = 0:  h = 1 

m = l:  h = x,  h = y 

m = 2:  h - cq1x2  - c^y2. 


h = xy 
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THE  BUCKLING  PRESSURE  OF  AN  ELASTIC  PLATE  FLOATING 
ON  WATER  AND  STRESSED  UNIFORMLY  ALONG  THE  PERIPHERY 


OF  AN  INTERNAL  HOLE 


Shunsuke  Takagi 
Corps  of  Engineers 

U.S.  A rmy  Cold  Regions  Research  and  Engineering  Laboratory 
Hanover,  New  Hampshire 


INTRODUCTION 

To  test  the  strength  of  an  ice  sheet  floating  on  water  the 
following  measurement  is  regularly  performed  (Zabilanski  et  al.,  l): 

Dig  a hole,  place  a vertical  pile  of  various  shapes  and  push  it  breaking 
through  the  ice.  However,  the  mechanism  of  the  failure  is  not  yet 
clarified,  and  the  interpretation  of  the  data  is  not  yet  satisfactory. 

To  understand  the  basic  mechanism,  an  ideally  simple  case  is  chosen  and 
analyzed  in  this  paper. 

A paper  of  the  same  title  was  presented  at  the  20th  Conference  of 
Army  Mathematicians  (1971*).  When  numerical  work  was  attempted  in  the 
summer  of  1975 , it  was  found  that  the  analysis  presented  in  the  20th 
Conference  did  not  work  as  expected.  A new  analysis  as  reported  in  this 
paper  was  developed,  and  the  numerical  computation  was  carried  out. 


t 


1.  The  Problem 


Suppose  a thin  elastic  plate  floating  on  water,  extending  horizon- 
tally to  infinity,  and  stressed  with  uniform  horizontal  pressure  along 
the  periphery  of  an  internal  circular  hole.  We  are  interested  in  formu- 
lating the  buckling  pressure  and  the  deformation  at  the  failure. 

The  vertical  deflection  w of  an  elastic  plate  that  rests  on  a liquid 
and  is  subjected  to  a vertical  load  q and  the  horizontal  stress  of 
components  77^,  77  , and  , is  governed  by  the  differential  equation. 


2 2 2 

DV*w  + yw  = q + 77  ^ + 277  -Mr-  + A'  — (l.l) 

' xx  „__2  xy  3x3zy  yy  2 K 


dx 


where  D is  the  flexural  rigidity  and  y the  specific  weight  of  the 
liquid  (Ref.  2).  Let  r be  the  radial  distance  from  the  center  of  the 
hole.  In  our  problem  q = o and  the  deformation  is  cylindrically  sym- 
metric around  the  center  of  the  hole.  Then  (l.l)  becomes 


7 k/d2  A I d \ 2 , ^ , _ 1 d2w  ^ 1 diA 

° \dr2  r y \rr  dr2  ee  r dry 

where  l - (D/y)^^  is  the  characteristic  length,  and  77  and  77  are  the 

O TV  uu 

radial  and  hoop  horizontal  stresses  in  the  plate  (see  Appendix  B). 

Following  the  usual  treatment  (Ref.  2),  we  assume  that  the  horizontal 

stress  components  77  77  , 77  are  in  equilibrium  by  themselves.  Then 

xx  xy3  yy 

they  are  derived  from  a biharmonic  function  <p  by 
a2d> 

XX  2 


77 


yy 


32<j) 

3x2 
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xy 
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dxdy 


In  the  general  polar  coordinates  they  are: 
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ee 


In  our  problem  if  is  a function  of  r only  and  must  tend  to  zero  when 
r becomes  infinite.  Then  they  are  formulated  as 

N = - Ar~2 


rr 


N a = 0 

r9 


"ee  = 4r 


-2 


where  /!  is  a constant.  Constant  A is  positive  because  is  pressure. 
Instead  of  A we  introduce  nondimensional  constant  a and  express  the 
stress  components  as 


rr 


7h  -2 
-ay  l r 
o 


„ 7U  -2 

"ee  = aylo  r 


Introduce  the  nondimensional  length  x , 


(1.3) 


(1-U) 
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In  this  way  (1.2)  becomes 


d w 


dr 


2 

X dx3 


1-g  d^w 

2 2 
r dr 


At  x = <*>,  the  condition 
u = 0 


« - 0 

du 


1-g  du 
3 dr 


+ w 


(1.5) 


(1.6) 


must  be  satisfied.  At  x - x , where  x is  the  value  of  x at  the 

o o 

periphery  of  the  internal  hole,  we  consider  three  conditions:  (l)  the 

clamped-edge  condition 


w = 0 
du 

d?  = 0 


2)  the  simple-edge  condition 


W = 0 

djJ  + 
dx2 


v dzJ 
x dx 


= 0 


(1.7) 


(1.8) 


and  (3)  the  free-edge  condition 


d jj 

dx2 


+ 

x dr 


= 0 


dw 


d_  | dfw  + 1 
dr  \ ^2  x dr 


a dw ' 


- + is r 0 


(1.9) 


where  v is  Poisson's  ratio. 

The  second  equation  of  (1.8)  and  the  first  equation  of  (1.9)  are 
found  from  M = 0.  The  second  equation  of  (1-9)  is  derived  from 
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Q + (l/r)(3M  „/30 ) = 0.  The  effect  of  horizontal  stress  must  be 
r re 

counted  in  Q . In  the  rectangular  coordinates  x,  y,  shears  Q and  Q 


are  given  by 


m 


Q_ 


xx 


3x 


xy 


+ N 


3 M 3 M 

Q = — &L  + —WL 

y 9*  3 y 


xx  3x 

3u 


N |^ 
xy  3 y 


+ N — + N 

xy  3x  yy  3 y 
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(1.10) 


These  equations  are  found  by  extending  Hitenyi's  (3)  one-dimensional 
treatment  to  two-dimensional.  In  polar  coordinates  r,  0,  components  of 
shear  and  Q are  given  (see  Appendix  A)  by 

P D 
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3 M 

+ i 

3r  r 


3 M 


M + 
I rr 


r6 


30 


- M 


00 


+ — N + — — N 
3r  rr  r 30  r0 


3r  r r0  r 30 


+ — N + — — N 

dr  rd  r 30  00 


(1.11) 


Constant  a is  the  eigenvalue  to  be  determined  to  satisfy  the  boundary 
conditions  at  x = x^.  The  first  step  for  the  solution  of  this  eigen- 
value problem  is  to  discover,  given  a positive  number  a,  two  real 
functions,  u^(x)  and  W^[x),  that  are  the  solutions  of  the  differential 
equation  (1-5)  and  meet  the  boundary  conditions  at  x = 00  in  (1.6)  but 
are  not  restricted  at  r = in  any  way.  We  call  them  the  fundamental 
solutions . We  shall  find  them  later  in  the  following  form. 


to 

-/Ua  / 2 

/l^a  12 

^ + iw2  = I 

•'l 

(2  h \ 

(2  U \ 

+ 

1 

h-J 

+ ^r  + r -l) 

-1+1 

xr  -1_ 

/2  U , 2, 

e r -1  dr 


(1.12) 
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The  second  step  is  to  express  the  fundamental  solutions  as  power 


series  of  x.  Let  / {x)  (n  = 0,  1,  2,  3)  be  the  Fuchsian  type  solu- 

tions of  (1-5)  relative  to  x = 0.  Vie  shall  find  linear  combinations, 
3 


w.(x)  = ^ A,  f ( 

k / ten  n 


(1-13) 


by  determining  constants  A1  by  use  of  (1.12).  The  solution  w{x)  is 


a linear  combination  of  the  fundamental  solutions, 
u(x)  = A ^(x)  + B w^{x) 


(1-1** ) 


The  third  step  is  to  solve  the  simultaneous  equations  of  A and  B 
that  are  found  by  substituting  (l.lU)  into  the  boundary  conditions 
(1.7),  (1.8),  or  (1-9)  at  x = xq.  If  a root  of  the  algebraic  equation 
found  by  letting  the  determinant  of  the  simultaneous  equations  equal  to 
zero  is  positive,  the  root  gives  x . Our  problem  is  then  solved, 
la.  Abstract  of  the  result. 

The  main  feature  of  the  numerical  result  is  as  follows: 

1.  Buckling  takes  place  under  the  free-edge  condition.  Buckling 
does  not  take  place  under  the  clamped-edge  and  the  simple-edge  conditions. 

2.  Eigenvalue  a under  the  free-edge  condition  is  found  in  the 
2 

range  1-v  <_  a < 00 , where  v is  Poisson's  ratio  of  the  elastic  ice  plate. 

2 

When  a = 1-v  , root  x is  equal  to  zero.  Analysis  presented  here  is 

c 

2 

complete  for  the  case  1-v  < a < 2,  but  not  complete  for  the  case  2 < a < 00 . 

It  is  believed  that  the  result  presented  in  this  paper  can  practically 
cover  all  the  cases  of  our  interest. 

3.  Buckling  under  the  free-edge  condition  takes  the  shape  as  shown 

2 

in  Figure  5 and  6.  ( a is  restricted  to  1-v  £ a <_  2).  This  shape  of 


a 


deformation  is  observed  frequently  in  laboratory  experiments  and  field 
tests.  Therefore  we  may  conclude  that  buckling  is  an  important  mechanism 
of  failure. 


•.  I 


« 
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I « 
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PART  I.  FUNDAMENTAL  SOLUTIONS 
2.  Fuchsian  Type  Solutions 

Equation  (1.5)  has  a regular  singularity  at  x = 0.  The  solutions 
relative  to  x = 0 are  the  Fuchsian  type  power  series  of  x.  Their 
indie ial  numbers  are: 


v = 0 

o 


V1  = 2 


V2  = 1 + u 


V3  = 1 - u 


where 


v = *T-a  (2.1) 

The  four  solutions  may  be  expressed  with  a single  formula 


fjx)  - T)  * 

m n 


(m)  m + hn 


(2.2) 


n=o 


where  m = 0,  1,  2,  3,  and 

M 


a ' = 1 


(m) . 


and  the  rest  of  the  coefficients  (n  > l)  are  determined  by  the 


recurrence  formula. 


|(v  + lm)(v  + lm-2)(v  + lw-l-u)(v  + Un-1  +v) 

n n-1  | mm  m m 


-1 


(2.3) 


Their  individual  forms  are: 

CO 

n 


foU)  = 2 


“ ^(2 n)l  r /n+^( 3-u )j V |n+^( 3+u )) 


(-1) 


(-1) 


■'(r*5-"1) 

r< 

5+ij)) 

f(n+i(5-u)) 

r! 

| («•*-£(  5+h)^ 

hn+2 


Wx> 


(2.M 


(2.5) 


y (-1)"  r(i(3-|l))  r(i(^")) 

U^n!  r^?+|  (2+u)j  r(n+^(3+w)i  r^j+J(  5+w)J~” 


lm+1+U 


(2.6) 


where  k=l,  2.  In  (2.6)  we  have  introduced  the  convention  that  the  upper 
or  lower  of  the  double  sign  + (or  +)  should  be  taken  according  to  k = 1 
or  2,  respectively.  This  convention  is  observed  throughout  the  paper. 
The  main  objective  of  PART  I is  to  determine  the  fundamental  solution, 
i.e.  to  determine  A ^ in  (1.13). 

Differential  equation  (1.5)  has  an  irregular  singularity  at  x = °°. 
In  other  words,  the  solution  relative  to  x = =>,  say  f (x) , can  be  found 
in  the  form 


fix)  = pn  x~  2 ~n 

n=o 

i 08 

||  ^r-“*  “~Yl 

where  A satisfies  A +1=0.  The  series  P x in  this  equation 

n=o  n 

is  asymptotic  and  divergent  in  this  case.  Therefore  this  equation  does 
not  provide  any  means  for  determining  in  (1.1$). 


3.  Countour  Integral  Solution 

In  order  to  find  the  fundamental  solutions,  (1.5)  must  be  transformed 


by  means  of  the  contour  integral. 


% >'■ 


/. 


( G ) e Q 


(3.1) 


where  L is  a contour  in  the  complex  plane  of  £ that  shall  be  determined 
to  let  a solution  of  (1.5)  satisfy  the  boundary  condition  (1.6)  at  x = 00 . 
Following  the  usual  procedure  (ince  (U),  pp.  187-188),  one  arrives  at 


the  differential  equation  of  v (5), 


(mS  ~ + 10 c3  ~ + (23 +<z)c2  + (9+3akv  = 0 (3.2) 

dc3  dC2 


The  contour  L selected  for  this  solution  is  shown  in  Figure  1. 
To  find  the  solution  relative  to  C = let 


where 


C = Sr 


6 = exp(3ni/k). 


(3.3) 

(3.U) 


Then  the  equation  (3-2)  becomes 

(r**-l)  + lOr3  ~ + (23 +a)  r2  + (9+3 a)rv  = 0 

dr3  dr2  ^ 

(3.5) 

This  equation  has  a regular  singular  point  at  r = <*>.  The  indicial 

numbers  X (m  = 0,  1,  2)  at  r = ® are: 
m 

Xo  = 3 

A1  = 2 + u 

*2  = 2 - u 

where  y is  given  by  (2.1).  The  solution  v (r)  corresnonding  to  the  indi- 

m 

cial  number  X is: 
m 


v (r)  = V 

m 

nr  o 


(m)  -X 
q r m 

• v? 


(3.6) 


I 


r» 


t* 


where 


n-1 


<i{nw)  = n (^p+V(l4p+V2) 


( lp+A^+2-p  ) ( lp+A^+2+p ) 


-1 


(3-7) 


P=  0 


The  contour  L must  be  such  that  the  point  C = 8 is  a branch  point 
of  y(c).  This  condition  is  satisfied  by  the  series  u^(r)  and  u^(r),  as 
shown  by  (3.10)  below.  These  two  series  can  be  expressed  by  means  of 
hypergeometric  series  F ( 

i.  \ 

(3.8) 

The  hypergeometric  series  are  summed  up  by  use  of  the  formula 

l-2a 


y^,(r)  = r 2+M  F ^(2+u)A,  ( U ) / 1+ ; (2+u)/2;  . 


F(a,  ^ + a-,  2a;  z)  = 2 


2a-l 


(1-z) 


1 + /1-Z 


(3-9) 


[Handbook  (Ref.  5)  p.  556,  Formula  (15.1.A1 


Thus  V^(r),  where 


k = 1 or  2,  reduces  to 

-1 

vk(r)  = (rU-l)  2 | ( 1/2) (r2  + 


nr 

Jr- 


i) 


+li/2 


(3.10) 


This  equation  shows  that  C = 6 is  a branch  point.  Formula  (3-9)  can 

be  proved  by  showing  that  the  one  or.  the  right-hand  side  satisfies  the 

hypergeometric  differential  equation  of  the  one  on  the  left-hand  side 

and  also  that  they  satisfy  the  same  initial  conditions  at  z - 0. 

Suppose  that  y.^(r)  on  one  of  the  branch  A-B  of  L in  Figure  1 is 

given  by  (3.10).  Then,  uj,(r)  on  the  other  "branch  ^ is  given 

+u/2 


I ^ 

i { 

-(rU-l)  2 [(1/2) (r^r1*-!)] 

Thus  one  finds  the  integral  solution, 
F(x)  = 


jf  (r2  +|7^l)  2 + (r2+J7^l) 


6a-r  / b . 2 , 
e ( r -1 ) dr 


(3-11) 
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The  fundamental  solutions  U^(x)  and  W^[x)  are  given  by  the  real  and 
imaginary  parts  of  F(x) , 

F{x)  = u^x)  + i x2(x)  (3.12) 


U . Integration  of  the  Integral  Solution 

We  shall  integrate  (3-ii)  to  a linear  combination  of  fQ(x),  f^(x), 
fn(x)  and  f.j{x).  The  first  step  for  this  goal  is  to  change  the  range 
of  integration  in  (3-11).  Introduce  a complex  variable  2 = fir,  where 
3 is  given  by  (3.U).  Use  of  2 transforms  (3.1l)  to 


F'(x) 


zx 

e 


B_1(-2U-1) 


(U.l) 


The  range  of  integration  3 00  3 in  (U.l)  shall  be  changed  to  3 ^ 0 

and  0 'v  — 00 . Thus  (U.l)  becomes 


F(ar) 


zx 

e 


i_13_1(2i*+l) 


(U.2) 


In  the  above  equations,  quantities  inside  the  square  roots  are  chosen  to 
be  positive  in  order  to  insure  correct  forms  in  the  respective  ranges. 
Letting  2 = Br  in  the  first  integral  and  2 = -r  in  the  second  integral, 
(U.2)  transforms  to  a summation  of  normal  forms  of  integration. 


F(x)  = ih  {x)  + ih2(x)  - ®exp(-uni/U)  g^x)  - Bexp(piri/U)  g ^ (sc) 

(U.3) 

1 + a- 

where  f , . 2 , 2 

h^(x)  = I yr  + i/l-r  j e^’fl-r  ) dr  (U.U) 


fo  |h  ro|H 


r 


-XT  , U . 2 , 

e (r  +1)  dr 


(>*.5) 


Expansion  of  exp(Bxr)  transforms  (k.k)  to  power  series  of  x, 

hk{x)  = h{ok)  +ex  h + |r  (e®)2^k)  + ...  (U.6) 


where 


h (k) 

n = 


£ (r2  . i/^5) 


n,^  U s.  2 
r ( 1-r  ) or 


(U.7) 


Integration  of  (b.j)  will  he  carried  out  later.  Integral  (1*.5)  trans- 


forms to  power  series  of  x 

ff±(x)  = 0O(1)  + + gj'1'1*  1+v  + g2  x2  + 


(>4.8) 


(2)  A „ (2)  1-u  ^ _ (2). 


92(*)  = *0l  / + V * +*i  * + * 


(2)  2 


X + . . . 


(>4.9) 


as  explained  in  the  next  section.  Thus  one  finds  F(x)  in  the  following 
form. 


F(x)  = B + B^x1^  + B x + S(l)x1+,J  + Bx2  + ... 
o u 1 y 2 


where 


B = i(h^  + J^2')  - a(e 

o \ o o J \ 

B1  = i6(^l1)  + ^l^)  - e ( 


ym  ynt  \ 

' It  (1)  >4  (2)1 

6 So  + 6 do  ' 


- UTIt  f n\\ 

>4  (1)  >4  (2) 

; 9,  + e g.  ) 


(>4.10) 


(>4.11) 


(>*•12) 


•a  ■ §('41,*4l-4'^41 

- UTT  j 

B = - Be 

M yU 


> + 


(>4.13) 


(>*.!>*) 


In  this  calculation  we  tentatively  assume  that  0 < a < 1.  Series  are  ar- 
ranged in  the  ascending  order  on  this  assumption.  The  formulas  for  the 


3C9 


values  of  a outside  the  range  0 < a < 1 will  be  derived  from  the  formulas 

in  this  range.  Note  that  the  entries  in  (l*.10),  except  B^x,  are  the  firs* 

terms  of  f^{x),  where  m = 0,  1,  2,  3-  The  first-order  term,  B^x,  is  not 

contained  in  any  cf  jMx) ; it  is  proved  later  that  B^  = 0.  The  entries 

in  (k.10)  are  sufficient,  to  express  w^(x)  as  a linear  combinations  of 

/ (x) . 
m 

ba.  Formulas  of 


(k) 

We  shall  give  the  integral  forms  of  g by  successively  developing 
(^•5)  into  series  of  x.  Integration  of  these  formulas  will.be  carried 
out  in  the  next  section. 

(i.\ 

Letting  x = 0 in  one  finds  g' 


(r2  + ,/r+l)  2 (r^+l)  2 dr 


(Ua.i) 


To  find  g ^ ' , the  formula 


- g\ 


- f <r2  if\l 


<r2  t/Al)2  (c-ra  -i)  (Alf  2 or 


shall  be  transformed  by  introducing  £ = rx  to 


co 

2'7 


U2+Uh+xh)2  (e  C-l)  (7+xS  2 dC 


(ba.2) 


Letting  x = 0 inside  the  integral,  one  finds 


9(?)  - * 


KU~2  (e  C-l)  d£ 


( . 3 ) 


/ 2 \ • ^ 

To  find  g ^ , multiply  x u on  (**a.3)  and  subtract  it  from  (ka.2). 


Thus  one  finds 
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92M  - 9{02)-  g{2)  x1'* 


oo 


Lvr  * «s. 


1 v \ 

o2  rV-2]  , -C 


(?V)  ‘ U‘+  C+xHr  -2*  dl 


Letting  £ = roc,  this  transforms  to 


/-rx 

<j>..(r)r  — dr 

Y1  rx 


( Ua.  I4 ) 


where 


h , 

+ r +1 


1 • 1 u 

2^  , U 2 „2  u-2 


(r)  = (r  +|r  +1)  (r  +1)  - 2 r 


(Ua.5) 


Because  of  the  inequality  1 > (l  - e U)/u  >.  1 - u/2  the  integrand  of  (La.i*) 
is  uniformly  bounded.  One  can,  therefore,  let  x -*  0 inside  the  integral. 


,(2)  „ 


-h 


(r)  r dr 


(J*a.6) 


To  find  » multiply  x on  (ka.6)  and  subtract  it  from  (ba.U).  Thus 

one  finds 

1 s (2)  (2)  1-y  (2) 

9 2<«)  " 90  ~ 9^  x - g1  x 


a 

= x2  J 


, 2 e_Xr  - 1 + rx 

) r 2~2 ^ 

r x 


Because  of  the  inequality  0 >.  (l-u-e  U)/u^  >_  - ~ the  integrand  of  the 
last  integral  is  uniformly  bounded.  One  can,  therefore,  let  x 0 inside 
the  integral.  Thus 


2 J 


r)  r dr 


(fca.7) 


To  find  <7^  , one  raay  simply  differentiate  g^{x)  in  (^.5)  with 
regard  to  x and  let  x = 0 in  the  result.  Thus  one  finds 


g = -J"  (r2  + Jr1*  + l)  2 [r/lr^+l)  2]  dr 


( Ua. 8) 


To  find  g^\  use  (^a.l)  and  (ka.8)  to  derive  the  formula 


g^x)  - g^  - g^x  = y"  (r2  +J  r^  + l)  2 (e  rX  - 1 + rx)  (r^+l)  2 dr 
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>.„r  2 


Letting  C = rx,  this  becomes 
«”  1 


= X 


,1+U 


I 


U^+xS  2 (t,2+Uk+xh)  2 (e~C-l+£)  <1C 


(Ua.9) 


Letting  x = 0 inside  the  integral,  one  finds 
0 2 / ^-u-2  (e"S  _ x + e) 


I 


.(1) 


(Ua.10) 


To  find  , use  (Ua.9)  and  (ta.lO)  to  derive  the  formula 

, , (1)  (1)  (l)  l+u 

g i(*)  - 0O  - x - 9V  x 


03 

_ 1 

u 

V 

= x1+y  f 

(i^)  2 

(£,2-Jt,U+xU)  2 

-2  2 ry-2 

(e'C-l+C)  de 


Letting  f;  = rx,  this  transforms  to 
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= x 


I 


-rx 

/ n 2 e - 1 + rx 

<f»2(r)  r 2~2 & 

r^x 


( ^a.ll) 


where 
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Ar)  = (r^+l)  2 (r2+/r  +1)  2 -2  2 r u~2 


Because  the  integrand  of  (^a.ll)  is  uniformly  bounded,  one  can  let  x -*■  0 


inside  the  integral.  Thus  one  finds 


$2(r)  r dr 


(lja.12) 


l*b.  Evaluation  of  a 


The  independent  variable  n introduced  by 


21k  , -1/2 

r + / r +1  = n 

is  useful  for  the  following  integrations.  This  transforms  to 
= (1  - n)/(2n1/2) 

J 1 + = (1  + n)/(2n1/2) 


(Ub.l) 


r dr  = -[(1  + n)/(UnJ/  dn 
Substituting  these  in  (1+a.l),  one  finds 

9<H  . ,^,-4  4 , if) 

- -®'1  r(HJL)r(H -)  -i» 

Eq.  (l)a.8)  is  similarly  integrated  to 

41  - v1 


Use  of  n transforms  (lja.6)  to 

„<2>  = I C M 


-if  - 


2 _1]  - f -1 

n 


(1  + n)(i  - n) 


(**b.2) 


(*4b.3) 


By  letting  1 + n = 2 - (l-n),  the  last  integral  is  divided  in  two  por- 


tions; thus  g^  becomes 
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i J 1 J 

h J1  2 J2 


where 

r r -1  - - -l 

J1  = f 1 - (1  - n)2  n ^ dp 

* ° 

and  the  remaining  integral  <J  g simply  integrates  to 

J2  = - B(v/ 2,  1 - pA) 

- - -1 

After  partial  integration  of  n U , integrates  to 
= -H/p  + 2B( p/2,  1 - p/U) 

Thus  one  finds 

g[2)  = p"1  AbA) 

Combining  (Ub.3)  and  (Hb.U)  the  result  may  be  shown  with  a single  formula. 


Jk)  - z .-1 

9±  = + u 


Ab.5) 


After  partial  integration  of  l ’,v  ( ba. - 3 ) integrates;  after  two  times 

— u— 2 

of  partial  integrations  of  £ , Aa.10)  integrates;  the  two  results 

are  shown  here  with  a single  formula: 


gW  = ± 2+p/2  Iy(l±u)]-1  r(l+p.) 


Ab.6) 


We  express  g ^ in  (Ua.7)  and  in  (ta.12)  with  a single  formula. 


oo 

■i/ 


(Alf  2 <r2  * Al)*"/2  -2;“/2  r-2*“  r2  dr 


Use  of  n changes  this  to 


5 + v 


U (1  - n)1/2  dn 


T 


Letting  l+n=2-(l-n)»  this  can  he  divided  in  tvo  integrals, 


g{2k)  = (8/2)  + (l*/2)  Kr 


where 


1 . V 


^ -f  n'  11  1 h (l-n)1/2  d„ 

•'o 

and  the  remaining  integral  simply  integrates  to 


*2  = -E 


+ v 3 - tA 
2 ’ k ) 


_ 5 + u 

After  the  partial  integration  of  n U - U , integrates  to 

- ^4  I * (I  • ■a4ji)  -<-»>  *(4*.  4*)] 


Thus  one  finds 


- 4-4*) 


he.  Integration  of  h 


fltb.7) 


2 ■ L k 

r + z 1 - r = c, 


(hc.l) 


r2  = (1  + c2)/(2c) 


1 - r1'  = z (1  - c)2/(24 


rdr  = (c2  - l)/(hz,2)  dc 


n-1 

2 


1 


4k>  - - - r(J . 


irt(-l+g)\  . Ti(l+g) 
-in  r — 


Thus  one  finds 


^U)+^(?)_  ? Ml)  M2) 

o o 
and 


M .I<2V  ( 


gn  . , 

ccs  — -t) 


41’  * 42) 


= t2  2 (i*1*  +i^2j  + M [M  • ; rc^-jr^-)^  - j 


fi 

w 


! 

u 

‘i 

f 


Taking  the 

real  parts 

, one  finds 

fc(1> 

+ ;:(2)  - 

(2/2*  r1 

o 

o 

and 

II 

c\i 

'w'  c 

+ 

(l-g2)"1  f 

’2  • ‘2  " ' V Tt  ( !*  ) ( ^ ) C°s  2 

4d.  Fundamental  solutions  for  0 < a < 1. 

Substituting  (4b. 2)  and  ( h.  c . U ) into  (U.ll),  one  getr 

Bq  = (2/2^)_1  r(^)  F(^p) 

Substituting  (4b. 5)  and  (4c. 3)  into  (4.12),  one  gets 
B1  = 0 

Substituting  (4b. 7)  and  (4c. 5)  into  (4.13),  one  gets 
& (l-g2)]"1 

J 

Substituting  (4b. 6)  into  (4.l4),  one  gets 

-1 


(4c. 4) 


(4c. 5) 


B2--  i 


<T )r(V 


b<'k>  = + 

g 


g(l-g2) 


g_ 

+ 2 


-,/„t  \ (3+g)nt 

! ( 2+p ) exp  — £ 


Thus  one  finds 
F{x) 


Bo^ix>  + y2(x>  + 4M(x>  * 4s)f3M 


(4d.l) 


(4d.2) 


( 4d. 3) 


(4dJ 


(4d.5) 


When  0 < a < 1,  functions  f (x)  [m  = 0,  1,  2,  3)  are  real.  Therefore, 

TTi 

fundamental  solutions  U,  (m)  and  U„(m)  arc-  found  by  decomposing  the  reef- 

1 C. 

ficients  into  real  and  imaginary  parts.  Thus 


ui<*)  =p„fJx'>  + Pi  fo^  + p? 


(^d.6) 


= a fjx)  + q f (x)  + <?5  j (x) 


(^d.7) 


P0  = (2^)"  'r(^)  ^ 


(fcd.8) 


(l-u*)  ~ r(^f)  r(^) 


(l*d.9) 


Pv  = + M 


P -1  + 2 _ ' ^+u  )u 

: ' 1-w  ) 2 F(2+u)  cos^^ 


i '■>  , 24  1 

IP  V ——VI  ) 


2 r(2+p)  Sin- 


( 3+u ) ■ 


5-  Fundamental  Solutions  for  a = 1 

When  <2  = 1,  (1.9)  reduces  to 

U ? 

d w 2 d w 

~ x ~1  + W = 0 

dx  ux 

Kevel  (1961)  gave  the  Fuchsian  type  solutions  of  this  equation  with  the 


notations , 


nev  (x)  = 

o 


(-IT  V 

h hin^-  rK> 


n ev  (x)  = 


” , U»  r(f)  r<£) 

g -7^77  h„7) 


nev^tx)  = 


E .(-i)rt  r(t;, 

K=0  h°'n(2n+l):  r (n+f) 


r<!>  ? 


nel  (x)  = ne\j(x)  logx- 


V (-1)"  r(^  r<^'  V ( 1 + i-  + -J—  ^ ->+1 

'/r  >*»(»!)*  r(W|)  r(n^>  £-  l**-1  * w+ir 

To  find,  the  fundamental  solutions,  the  unknown  function  w(x)  must 
he  transformed  to  the  unknown,  function  v(^)  defined  by  the  contour 
integral  (3.1)-  The  transformed  differential  equation  is 

(1  + c1*)  + 2c3y  = 0 

dC 

and  one  finds 

y(c)  = (i+?V1/2 

Therefore  the  complex  solution  w{x)  for  a = 1 is  found  in  the  integral 


(5.1) 


L \/l+? 


The  contour  L is  the  one  shown  in  Figure  1.  To  find  the  fundamental 
solutions  in  the  form  of  the  linear  combination  of  the  nev  functions, 

J.  Dieudonne  (1958),  as  explained  in  Bevel  (1968),  expanded  (5-1)  into 
power  series  in  the  neighborhood  of  x = 0,  and  determined  the  first  few’ 

coefficients.  The  fundamental  solutions  thus  found  are  denoted  here  by 

N J N 
W,  and  W : 

1 2 2 

( x ) = (t/2m)  1 r (^-)  nev^ix)  - m(2/2)  ~ nev^(x)  + {2^2t\)  1 ‘ (7)  Kev^ix) 


vl(x)  = (8/n)  1 r (^-)  nev(x)  - (t/m  )-1  r (r- 


(7)  nev^ix)  + nel-^x) 


- (l  - y + log  /2)  nev^ix) 
where  y is  Enler's  constant  0.5772156. 


Vie  shall  show  in  the  following  that  ^(x)  and  wAx) 
(Id. 7),  respecitvely , gives 


in  ( itd . 6 ) and 


1 in  w Ax)  = 

lJj(x)  + fk 

a-hL 

lim  w Ax'  = 

^(x)  - /k 

w2!'(x) 

a-*  1 

'o  show  this,  note 

that 

lira  p f ( x ) 
. o o 

c-+l 

= (2/2tT)-1 

F (-) 
V 

new  (x) 
o 

_2  , 

lira  9 friz) 
a-*  1 

= (2/2tt)_1 

nev^ix) 

lira  f]<+1(x'>  = nevAx) 

a+ 1 


We  shall  prove,  therefore,  that 

lira  /2U)  + P2  f3(xj  = - ^2  (1  - y + log/2  + (x)  + fenel^i 

(5.2) 

lira  ^1  /g ( x ) + <72  ~ (i  _ 1 + log^  - ^-)  ney-^x)  - /2  nel^{x) 


The  left-hand  side  of  (5 .2)  tecor.es 


,im  /Pl  /2(x)  + p2  f (r))  = XI  lira  ± (- 

^1  ' 4 ft  1 U^G  \ 


(5-3) 


Cx(n)  + C2(n| 


where 

+ f 

£*,(«)  = 2 f(2+u) 


r(l(2^')  Ffe(3^j  r(fojH') 

f(n+|(  2+h))  r(n4(3+f))  T (n+^(  5+t )) 


. ( 3+U  )ir 


faking  the  limit,  one  finds  that 


lim  — [ — 


: ":<f>  <I>  ] _1  I1 

n n } 


X 7T 

1 - Y - leg — + u + 

v2 


I I + 1_  + 1 

i5p+l  2p  Up- 1, 


where  we  have  introduced  the  convention  that  the  summation  "Yl  disappears 

F= 1 

when  n = 0.  This  equation  proves  (5-2). 

The  left-hand  side  of  (5-3)  becomes 


lim  L f (x)  + <?2  / (x)j  = -fe1;-  Ln+1  lim  7 K(n)  + 52(n) 

a->l  \ / ""  U n ! u-*o  ' 7 

n=0 


where  S ^ (n)  can  be  given  from  («)  by  replacing  cos  ^(3+u)t/^ 
with  sin  /( 3+b ) 7i/ly.  Taking  the  limit,  one  finds  that 
lim  ^ f-  ( n ) + S An  n 


r2  nl  (£)  (f) 

n n 


, - X IT  V"''/  1 1,1’ 

1 - Y “ loe^  ' t * ¥ JR 

i p*i 


where,  by  convention,  the  summation  ^ disappears  when  n = 0.  This 

P= 1 

proves  (5-3). 

6.  Fundamental  Solutions  for  c = 0. 

When  a = 0,  (1.5)  reduces  to 


- 1 * \ 2 

— + r — ) U + d = 0 

2 x ax  I 


as  may  be  derived  by  putting  //  = 0 ar.d  /Vgg  = 0 in  (1.2).  This  equation 


can  be  decomposed  in  two  equations. 


Id  .\ 

T + 7 S?  - 1 K = 0 


(6.1) 


! 


1 d__ 
x dx 


0 


(6.2) 


The  solutions  of  the  tvo  equations  satisfying  the  boundary  condition 
(1.6)  at  x = ™ are 


- kerx  + i keix 

and 


(6.3) 


Wg  = kerx  - i keix  (6.U) 

giving  the  fundamental  solutions  kerx  and  kerx.  Thus 


w = A kerx 

+ B keix 

(6.5) 

We  shall  prov 

e that  a'^(x) 

in  ( Ld .6)  and  w^(x) 

in  (Ua.7)  satisfy 

lim  U^(x)  = 

a-*o 

/2  kerx 

(6.6) 

lim  W2{x)  = 

a-±o 

keix 

(6.7) 

First  ve  note 

that 

lim  f ( x ) = 

p-FL  ° 

lim  f (x) 
p-*l 

= berx 

and 

lim  / («)  = lira  fp(x)  = h beix 

M-+1  p-KL 

Letting 

A = 1 - u 

Ve  transform  (ltd.  6)  to 


IT 


3Jr 

O -3  > 


lim  v = beix  + /2(log2-a)  berx  + 12  1^(3^  - 3X 

X-+o  1 2 f~2  X_K5 

Substituting  from  (2.1+)  and  (2.6),  one  finds  that 


E 


3 fQ  3f  3 

lim  ( tt  - rr1)  = - logx  berx  + 

X->o  3X  ~ L 


which  proves  (6.6). 

We  transform  (i+d-7)  to 


(-1)"  X1"- 


E 


1 

p 


- ^kr 2 2 n^]-  * sl"r  ^ * 


4 


i^iiT ( 2-X ) 


r,2+X.  T/1+-X,  J. 

{~I T}  lT),Ji 


l-X 

2 


(l-X ) ( 2-X ) 


r(i+x) 


Xtt 


f 

(* 

1 


Letting  X-+o, 


lim  u 


2 


TT 

2/T 


3 A 3 ^2, 


terx  - /2(  1-7+106  + 2)  beix  + lim  (— - gx) 


X-+o  r'  2/  2 2t^  x"+° 

Substituting  from  (2.5)  and  (2.6),  one  finds  that 


3/-,  3j?  (_i)7 

- sr > " ,‘loeI  beix  ' h Z.  jgSJF 

x-o  „=1  4 


l+n+2 


2n+l ) l) 


2«+l 

E 

P-2 


1_ 

P 


which  proves  (6.7)- 

6a.  Eigenvalues  for  a = 0. 

When  a - 0,  no  horizontal  pressure  works  on  the  plate,  and  buckling 
should  not  take  place  under  any  boundary  conditions.  V,e  shall  prove  celow 
that  this  is  true  under  the  boundary  conditions  (1.7),  (i.t),  and 

(1.9). 

The  following  formulas  are  needed  for  the  proof.  Substituting  either 
(6.3)  into  (6.1)  or  (6.U)  into  (6.2),  one  finds  the  relations. 


0 


( 6a . 1 ) 


ker"x  + x 1 ker'x  + keix  = 

and 

kei"x  + x 1 kei'x  - kerx  =0  (6a.?) 

We  shall  prove  that  no  positive  number  xq  can  satisfy  the  clamped- 
edge  condition  ( 1 . T ) - The  determinant  of  (1.7)  is  given  by 

kerx  ker'x 
keix  kei'x 

when  (6-5)  is  used.  Differentiating  D , one  finds  the  differential 

equation 

dD, 

1 ^ -1  n ,2  .2 

3 — + x D,  = ker  x + kei  x 
dx  1 

Solving  this  equation  under  the  boundary  condition  that  = 0 at 
x = one  finds 

oo 

°1  = ~ x / ?(ker2£  + ker2C)  d? 

x 

which  is  negative  for  any  positive  x,  proving  our  contention. 

We  shall  prove  that  no  positive  number  xq  can  satisfy  the  simple- 
edge  condition  (1.8)-  The  determinant  of  (l.8)  transforms  to 

kerx  (l-v)x  ^ ker'x  + keix 

keix  (l-v)x-1  kei'x  - kerx 

This  equation  transforms  to 

on 

£>2  = J"  ^(ker^E;  + kei2£)  d£  - (ker2x  + kei2x), 

x 

which  is  negative  for  any  positive  x,  proving  our  contention. 

We  shall  prove  that  no  positive  number  x can  satisfy  the  free-edge 
condition  (l .9)-  The  determinant  of  (1.9)  transforms  to 
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(l-v)x  1 ker'x  + keix  - kei'x 

(l-v)x-1  kei'x  - kerx  ker'x 


This  equation  transforms;  to 


= -i—  (ker'2x  + kei'2x)  + x J"  ^{ker2^  + kei2^)  df. 


which  is  positive  for  any  positive  x,  proving  our  contention. 

7-  Fundamental  f.olur. ions  for  a > 1. 

For  q > 1,  d defined  in  (2.1)  must  be  replaced  with  u = ik,  where 


= ^IT 


(7-1) 


To  compute  f(x  + iy ) , we  use  the  formulas 


r(x+iy)/T(x) 


n 1 + '//(x+nf 


(7.2) 


CO 

Arg  T(x+iy)  = yty(x)  + £ y/(x+n ) - tan  ~[y/(x+n)\ 

rs  ' 


(7.3) 


[Handbook  (ref.  5),  p.  256].  These  formulas  can  be  proved  by  use  of 
Euler's  formula  for  the  Gamna  function  (ref.  (9),  P-  237). 

Using  these  formulas,  coefficients  of  F(x)  in  (Id. 5)  become 


Bo  = 2(27T)  2 r2(J}  n [1  + <2dp+l)"2] 


(7-1*) 


|(2iO  d r (|-)  n [i  + < (lp+3)-  ] 


(7.5) 


B ^ = +(a<)  x R exp(+  jp-  + log*^  + e0) 


where  R and  0 are  defined  by 


Y 


v 


r ( 2+i< ) = R exp(+i0) 

They  are  given  by 


R = n (n  + 2)  [( n+2 )2  + <2]  2 

n= 0 


and 


0 = k(1  - Y)  + ^2  ![k/(h+2)]  - tan"1  [ie/(n+2)]! 


n=  0 


Functions  /q(x)  and  /^(x)  are  real.  To  decompose  the  complex 
function  f j<+jl{^)  into  the  real  and  imaginary  parts,  the  denominator  of 
(2.3), 

( lm+l+|j ) ( lm-i+p ) ( Un ) ( Un+2\i ) 
is  transformed  to 

= 8n[2rc(l6n2  + U - 5 a)  + i<{32n2  _ a)] 

Thus  one  finds 


fk+lix)  ~ S (-1)V0)  exp(?tp  ) xhn+^ 


n= 0 


where 


►t) 

0 ' 
0 

--  1 

Po 

= 0 

p(o) 

n 

= 8 (nl ) 

n 

n 

,p= 1 

P {Up2  + 1 _ ^n)2  + (a-  1 )(Up2  - ^ )2‘ 


B J 0 / 

1 

I 1 

w 

pn  £ tan~  \<{hp  " fy 

p{Up2  + 1 _ |a) 

P=i  L 

for  n _>  1. 

Fundamental  solutions  U^(x),  Ug(x),  and  their  derivatives  are 
found  by  decomposing  F{x)  and  its  derivatives  into  real  and  imaginary 
parts.  We  formulated  them  (up  to  the  third  derivatives)  as  follows: 
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h|oj 


PART  II.  ASYMPTOTIC  EXPANSIONS 


Values  of  a series  solution  developed  in  PART  I must  overlap  on 
a certain  range  of  x with  the  values  of.  an  asymptotic  expansion  determined 
corresponding  to  the  respective  series  solution.  The  series  may  be  used 
for  any  x less  than  the  overlapping  range,  and  the  asymptotic  expansion 


may  be  used  for  any  x larger  than  the  overlapping  range. 

8.  Asymptotic  Expansion  for  0 < a < 1. 

Using  analytical  continuation  of  the  hypergeometric  function  in 


(3-8)  from  the  range  1 < r < ® into  the  neighborhood  of  r = 1 (more 

I4 

exactly  in  the  range  |l-r  | < l),  one  finds  that  v^(r)  in  the  contour 
integral  solution  (3.1l)  defined  in  the  range  1 < r < °°  is  analytically 

continued  to 


vk(r)  = yjk)  F^(2+p),  J(2-u);  J ; 1-r^  + 


r2(r  -1)  2 F(r-(2+ii),  £(2-jj);  k 1 (8-D 


defined  in  the  neighborhood  of  r = 1,  where 


Double  signs  may  not  appear  in  the  nypergeometric  functions  on  the  right 


hand  side  of  (8.1)  because  of  their  symmetric  properties  with  regard  to 


Letting  r = 1 + t and  developing  the  hypergeometric  functions  on 


the  right-hand  side  of  (8.1)  into  power  series,  one  can  integrate  (8.l) 


to  a complex-form  asymptotic  expansion  for  0<a  < 1 


F(x)  - 


where 


P,  = -(1  + 2a) /h 

M 

P2  = (9+20 a + ha2)/g6 

B^  = - (3  + a)/ 6 

B2  = -(3  + a)(5  + a)/ 120 

Asymptotic  expansions  for  u^{x)  and  w2[x)  are  given  by  the  real  and 
imaginary  parts  of  (8.h),  respectively. 

9 • Asymptotic  Expansion  for  1 < a < 2, 

A form  of  asymptotic  expansion  for  a > 1 is  found  by  letting 
u = i<  in  the  coefficients  of  v^1'1  + (k  = 1,2).  In  this  case 

formulas  (7-2)  and  (7-3)  need  to  be  modified  to  include  the  case 
x = 0.  The  modified  formulas  are: 

oo 

|r(iv)|2  = y~2  II  [1  + (y/n)2]'1  (9.1) 

*=1 

and 

390 


Arg  r (iy)  = - f sig n(y)  - yy  + 2^  ~ tan_1  *)  ’ (9.2) 


where 


sig n(y)  = 1 


for  y > o 
for  y < o 


The  result  of  the  transformation  becomes  extremely  simple: 
y^1^  + y^2^  = cos { <log2) 


(9-3) 


y^  + y^2/  = k sin[ic(-rY  + logv^)) 


(9.«0 


Substituting  these  into  (8.k)  the  complex  form  asymptotic  expansion  for 
a > 1 is  found. 

Our  numerical  computation  shows  that  this  asymptotic  expansion 
is  effective  only  for  a close  to  1.  We  used  this  formula  for 
2 > a > l. 

10.  Asymtotic  expansion  for  a ^ 2. 

Letting  p = i<,  the  integral  solution  (3-11)  transforms  to 


| F(x) 


e^1  cos  ^log  (r2  + v'r  -l)  (r^-l)  2 dx  (10. 1) 


Expanding  the  integrand  in  the  neighborhood  of  r - 1 by  letting 
r = 1 + t,  and  using  the  approximations, 

log(r2  + /F- 1)  = 2 ft  + 0 (t3/2) 


/r  -1  = 2 ft  + 0(t) 

one  finds  the  integral  asymptotic  solution. 


F(x)~ 


Bx(l+t)  „ / d t 

e cos(</t)  — 

ft 


(10.2) 


To  evaluate  this  integral,  define  the  function, 


<*>  = !/ 


^xt+i<ft  dt 
e — 

ft 


Then  (10.2)  becomes 


F(x)~ 


(v* 


) + G (i 


(10.3) 


(10. U) 


2 

Letting  t - £ , (10.3)  transforms  to 

°o  ^ 

G)<(x)  = f exp[6x(v,:F  S)2  + fe] 


Define  3 by 

bx  u;§)2  = -,2 

The  root  z of  this  equation  satisfying  the  condition  that  the  real  part 
of  z must  approach  positive  infinity  as  £ -*•  <*>  is 

I 1 

z = exp(-  ^)  x2  £ + exp(-^p)  2 

Use  of  z thus  defined  transforms  (10-3)  to 


GAx)  = x~  2 


ki»Bx  r 8 


i-/ 


°°exp(  51^/8) 


exp (-3  ) ds 


where 


— 5 1 — 

= +exp(gir£ ) -gcx  2 

Transforming  the  contour  of  integration  to  the  sum  of  two  contours, 

a.^o  and  0'''+°°,  one  finds 

k 

G1(x)  + Gg(x)  = fvjx  exp^K2/(86x)  + it i/8^  (k 

Thus  one  gets 


♦>1 


f 


F(x)  — /n/x  exp(frr  + k2/(8&t)  + iti/8) 


(10.6) 


We  use  this  equation  for  a >_  2. 

11.  Fundamental  Solutions  for  Large  a and  Small  x . 


Our  numerical  computation  shows  that  the  overlapping  range  of  the 
series  solution  and  the  asymptotic  expansion  moves  to  small  values  of 
x as  the  values  of  a increases.  When  a = 2,  the  series  solution  and  the 
asymptotic  expansion  overlap  in  the  neighborhood  of  x = 6.  When  a = 6, 
they  overlap  in  the  neighborhood  of  x = 1,  showing  that  the  fundamental 
solutions  at  this  value  of  a is  ineffective.  For  larger  a,  fundamental 
solutions  must  be  transformed  to  a more  effective  form. 

Following  formulas  were  used  for  the  transformation.  For  large 


values  of  y 

1 1 . 1 ..  1. 

T(x+iy)  — /2tT  e ^ (^-)  e 


(11.1) 


| ] -1  + |wt i 

T{x+iy)  If  [n+x+iy  )|  e y n 


(11.2) 


where  x and  y are  real.  These  formulas  can  be  derived  by  transforming 
the  asymptotic  expansions  of  the  Gamma-functions  by  using  the  assumption 


that  y is  large. 

When  x is  small,  the  number  of  terms  needed  for  the  summation  of 
series  fm(x)  (m  = 0,  1,  2,  3)  in  (2.10  'v  (2.6)  are  fairly  small. 
Letting  k be  large  under  this  condition,  formulas  (ll.l)  and  (11.2)  may 


be  applied  to  transform  series  f [x) . Thus  one  finds 

m 

fQ(x ) — cos[(2k)-1x2] 


111. 3) 


f^x)  ■ — 2k  sin[(2<)  ] 


f,  (x)  x1-^*  exp[+i(^K^)  1xl4] 


(11-5) 


Also  one  finds 


r^r-d+iiejj  r^(l-tK)j  ~ W K 
r^(3+i<)^  r(i(3-i<))  ~ IT  K2 


1 1 

— - TK1T 

2 U 

e 


O ~ f** 
2 h 

it  k e 


11  • 3 • 

- - -tktt  +IK  + ^ 

f(2+i<)  ~ (27nc  ) e (</e)  e 


(11.6) 


(11. T) 


(11.8) 


Thus  for  extremely  large  k and  small  x,  one  finds  the  complex  expression. 


F(x)  - A(1+H2k)~1x2)  + Bx1+1,IC  + Cx1'™, 


(11-9) 


where 


1 1 
"2  “ VC1T 

A = (2tt/k)  e 

1.  13  1 , • 

B ,0  ,2  - 2 ~ 

B = ^ 2 ( 2tt  ) k e < 

1 • 1 3 3 • 

C = - 2 2 (2tt)2  k 2 e k 


(11. 10) 


(11.11) 


(11.12) 


PART  III  EIGENVALUES 


12.  Computation  of  Xq 

Our  numerical  computation  shows  that  the  clamped-edge  condition 
(1.7)  and  the  simple-edge  condition  (1.8)  do  not  yield  any  positive 
number  xq  as  a root  of  the  respective  determinent  equations.  The  free- 
edge  condition  (1.9)  always  yields  roots  or  a root.  We  shall  discuss 
below  only  the  free-edge  condition. 

Define  operators 


- + 2L 
2 x dm 


(12.1) 


dL  + Idl 

dm3  x dm2 


1 -a  d_ 

2 dm 


(12.2) 


Then  the  determinant  D found  by  substituting  (l.lU)  into  (1.9)  is  given  by 


|L(w  ) M(w  )| 

Do  = (12.3) 

|L(w„)  M(wJ 

< . c. 

Root  xq  thus  found  in  the  range  0 < a 4 2 are  shown  in  Figure  2 


and  3. 

To  discuss  the  neighborhood  of  xq  = 0 in  these  figures,  take  the 
first  term  of  the  series  /^(x)  (m  = 0,  1,  2,  3),  and  approximate  u^(x) 

and  w^x)  in  (l*d.6)  and  (l*d.7)  with 

wx(x)  = P0  + P1x1+W  + P2x1-y  + 0 (x2)  (12.1*) 


u2(x)  = qoX2  + q±x1+V  + q^1^  + 0(x5_W)  (12.5) 

Because  M(x1-y)  = 0,  Mfw^)  is  negligible  against  M(«2).  Therefore  the 
root  of  (12.3)  is  given  by  L(u^)  = 0,  which  yields 


This  equation  shows  that  the  condition  u <_  v,  i.e.  a >_  - v , must  be  met. 


When  n = u,  x o becomes  equal  to  zero.  Each  curve  in  Figure  2,  therefore, 

terminates  at  the  intersection  with  the  axis  of  abscissa,  whose  coordinate 
2 

is  a = 1 - v . 

For  small  q,  (12.6)  becomes 


where  1,(3)  is  Rieman's  Zeta-function.  Our  numerical  computation  shows 
that  (12.7)  gives  close  approximation  over  the  entire  lengths  of  the 
curves  in  the  neighborhood  of  a = 1-0  in  Figure  3. 

To  discuss  the  neighborhood  of  = 0 for  the  case  1 < a < 2,  we 
used  the  complex  form  F(m)  in  (Ud.5)  with  coefficients  given  by  (7-M  — 
(7-6).  Taking  the  first  terms  of  f (x) , one  finds  that 

M(F)  = 2 B2  (l+<2)r_1 

Because  B ^ is  a pure  inaginaiy,  the  real  part  of  M(F),  i.e.  M(u  ),  is 
negligible  against  the  imaginary  p-rt  of  M(F),  i.e.  Mfu^).  Therefore 


= 0 is  equivalent  to  L(w^)  = 0.  Equating  the  real  part  of  L(F) 
equal  to  zero,  one  finds  that  xq  is  approximated  by  the  root  of 
tan(a+Klnx)  = 

[(v-k2)  exp(jpcTr)  - <(l+v)  exp(-^KTr)]  • [<(l+v)  exp(-jTcir)  + (v-k2)  exp( --jociT ) ] 

(12.8) 


where 


For  small  k (12.8)  reduces  to 


vH(x)  + 1 + v 


vit 

r 


i + 


(l+v)n  vnc 


where 


8 


96 


, ( l+v)ll  vn 

1 " U 32 


H(x) 


(12.10) 


H(x)  = lof|(/2)  aj  - 1 + Y (12. ll) 
Our  numerical  computation  shows  that  this  equation  gives  close  approxi- 
mation over  the  entire  lengths  of  the  curves  in  the  neighborhood  of 


a - 1 + 0 in  Figure  3. 

Equation  (12.8)  shows  that,  if  x is  a root,  then  x ^ given  by 
x^+1  = x^  exp(-w/iO  (12.12) 

is  also  a root.  Therefore  infinitely  many  roots  exist  in  the  neighbor- 
hood of  x = 0.  Roots  xq,  x1,  x2  and  x^  are  shown  in  Figure  U where 
k = /a-1  is  used  as  the  ordinate.  The  solid  line  covers  the  values  we 
actually  computed-  They  may  be  extended  to  the  left  of  the  solid  lines 
by  means  of  (12.10)  and  (12.12). 

The  asymptotic  behavior  of  the  large  roots  can  be  found  by  using 
F(x)  in  (10.6)  to  compute  L(F)  and  M(F).  Assuming  that  C defined  by 

C = «W)  (12.13) 

is  of  the  ordinary  magnitude  for  large  x,  one  finds 

L(F)  ~ F(x)  (B2  - 2C  + B_2C2)  (12- 11*) 

and 

M(F)  — F(x)  (1 +C2)  [exp(^)  - c exp(-  *£)  ] (12.15) 

Thus  one  discovers  that  there  are  two  asymptotic  roots. 
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xh  = iuchr  2 < 


(12.16) 

n • '«' 

where 

= 2 + ^ (12.17) 

In  these  two  equations,  suffix  h in  defined  by 

h = k - 1,  (12.18) 

where  the  old  convention  for  suffix  k is  still  observed.  The  two  lines 
in  Figure  U expressing  the  two  equations  in  (12.16)  are  shown  by  the  broken 
lines. 

Asymptotic  roots  were  also  computed  retaining  all  the  terms  in 
L(F)  and  M(F)  that  were  found  by  letting  F(m)  be  (10.6).  Carrying  out 
the  computation  of  D ^ as  given  by  (12.3),  one  finds  that  the  equation 
- 0 reduces  to 


n=0 

where 

N 


N x~n  = 0 

n 


= (i+0(i+c2)(i-kn2) 

= >'/2  [(1-v)  + k -2C2  + 8^  - (3-v)^] 


(12.19) 


(12.20) 


- - i +5(2-v)c  - (6-vk2  + (i2-7vk3 


N_ 


= [-( 3+v)  + 2(9-ll*v)c  - (15-I3vk2) 


Ny 


Yg  [(-3+8v)  +3  (15-I6v)d 

The  positive  roots  of  N are  £ and  C,  in  (12.17).  The  solid  lines 

o o l 

running  close  to  the  broken  lines  in  Figure  U cover  the  values  of 
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xq  and  computed  for  v = 0. 5 by  use  of  (12.19)*  The  values  of  xq  and 

in  the  range  a 4 2 were  computed  "by  using  the  series  (7.7) 
and  (7.8). 

The  asymptotic  behavior  of  the  small  roots  can  he  found  by  using 
(ll.9)  to  compute  L(F)  and  M(F).  One  finds  that 

K(F)  = i A (tcx)-1  (l+<2)  (12.21) 

Because  A is  real,  K(w^)  is  negligible  against  M(zu2)>  and  = 0 is 
equivalent  to  L(u^)  = 0.  For  large  k,  this  yields 

x ~ >Fk  e 

which,  however,  is  not  small.  Therefore  small  roots  do  not  accumulate  at 
point  x = 0,  when  k is  large. 

This  conclusion  does  not  yet  exclude  the  possible  existence  of 
roots  that  are  too  small  to  be  found  with  the  asymptotic  expansion  (10.6) 
but  too  large  to  be  found  with  the  approx imat ion  (11.9).  It  is  probably 
true,  however,  that  roots  x n ( n 2)  become  equal  to  zero  at  certain 
values  of  k and  do  not  extend  indefinitely  to  large  values  of  the 
ordinate. 

Extension  of  the  curves  expressing  x n (n  > 2)  beyond  the  ordinate 
■e  > /3  was  not  attempted.  Our  interest  was  originally  in  small  a, 
and  moreover  we  did  not  have  enough  time  to  have  series  improve 
for  the  case  a > 1.  However,  we  believe  that  small  roots  are  not 
important  for  engineering  purposes  and  need  not  be  known  in  detail. 

13.  Deformation 

Forms  of  deformation  corresponding  to  the  roots  x were  calculated  in 
2 

the  range  1-v  ^ a 4 2 by  assuming  the  normalization, 


< 
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1 


(13.1) 


= 

Two  cases,  {a  = 1,  v = 0.3)  and  (a  = h,  v = 0-3),  are  shown  in  Figure 
5 and  6,  respectively. 

These  forms  of  deformation  have  often  been  observed  in  laboratories 

and  fields  when  floating  ice  plates  are  compressed.  We  are  now  convinced 

that  buckling  is  frequently  taking  place. 

Forms  of  deformation  other  than  shown  in  Figure  5 and  6 can  be  guessed 

by  use  of  Figure  7 and  8,  where  the  values  of  U ■ (minimum  depression) 

mm 

and  x . (defined  by  w . = w(x  . ))  determined  for  x are  shown.  (See 

mm  mm  mm  o 

Figure  5 For  the  definition  of  x . on  a curve  of  deformation).  Values 

mm 

2 

a in  these  figures  are  restricted  to  1-v  <_  a <_  2.  We  did  not  compute 


4 

-.  i 

f'  1 

' ! 


* ’ 


U 

n 


V 

« . • 

*•  • 

f 


* 

i* 


.) 


them  for  the  case  a > 2,  nor  for  (n  >.  l)  except  for  the  cases  shown 
in  Figure  6.  The  broken  lines  in  these  figures  are  determined  by  the 
terminal  condition  v = u. 

The  deformation  at  fracture  shall  be  determined  by  assuming  that 

the  stress  at  x . reaches  the  fracture  stress  o ~ . In  the  general 
mm  f b 

polar  coordinates,  stress  components  o , o„. , a - are  given  (see 

W DO  T D 

Appendix  C)  by 


and 
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vhere  h is  the  thickness  of  the  plate.  In  our  case  of  axi symmetry , 
introducing  the  nondimensional  length  x defined  by  (l.M,  the  above 
formulas  become 


E h d i)  v d u 

2(l-v)l  2 dx2  x dx 

o 


Ih  1 du  , d?w 

I—  + V 

,,  i.  2 i dx  ,2 

(l-v)fc  I ax 


(13-3) 


a „ = 0 

r6 


At  point  xm£n>  where  du/dx  = 0,  therefore. 


|°rr|  > | a6  8 | 


Let  U„(x)  be  the  normalization  of  w(x)  at  x = x . Then  the  de- 
N o 


pression  is  given  by 
u(x)  = K u^(x) 

where  K shall  be  determined  by  applying  the  condition  that 


(13- M 


o = o„  at  x = x . 

1 rr 1 f rcnn 


(13-5) 


where  cy  is  the  fracture  strength.  Summing  up  the  above  results, 
K is  found: 


K = 2 12  a . (Eli)-1  H(v,a) 


° I 


where 


H( v,a)  = (1-v2)/ 

\dx  t 


(13.6) 


( 13 - T ) 


Values  of  H(v,a)  are  shown  in  Figure  9 for  the  case  l-v^  < a < 2. 


405 


ACKNOWLEDGEMENT 

Professor  Jean  Dieudonne  (Iiaitre  de  Conference  a la  Faculte  des 
Science  de  Hice)  solved  the  boundary  value  problem  of  (5-1)  during  his 
visit  at  Northwestern  University  in  1958  at  the  request  of  the  U.S.  Ar: 
Snow,  Ice  and  Permafrost  Research  Establishment,  then  at  Evanston,  111 
as  recorded  in  references  (5)  and  (9)*  His  method  inspired  the  author 
for  developing  the  method  of  transforming  the  integral  solution  (1.12) 
to  the  linear  combination  (1.13). 

Numerical  computation  was  performed  with  the  help  of  Duncan  M. 
Fitchet,  student  at  Dartmouth  College. 


REFERENCES 


■ 


(l)  Zabilansky,  L.J. , D.E.  Nevel,  and  F.D.  Haynes:  Ice  forces  on 

model  structures.  Canadian  Journal  of  Civil  Engineering,  2(1975), 
UOO  - Ul7. 


(2)  Mansfield,  E.H.  (196*0  The  Eending  and  Stretching  of  Plates. 

The  MacMillan  Company,  New  York. 

(3)  Retenyi,  M.  (19^6)  Beams  on  Elastic  Foundations.  The  University 
of  Michigan  (Ann  Arbor)  Press. 


(U)  Ince,  E.L.  (1972)  Ordinary  Differential  Equations,  Republished  by 
Dover  Publications,  Incorporated. 

(5)  Handbook  of  Mathematical  Functions.  Rational  Bureau  of  Standards 
(196B). 


(6)  Nevel,  D.  (1961)  The  narrow  free  infinite  wedges  on  an  elastic 
foundation.  U.S.  Army  Cold  Regions  Research  and  Engineering 
Laboratory  Research  Report  79- 

(7)  Dieudonne,  J.  (1958)  Study  of  the  equation  y""  + 2xy""  + w = 0. 
Unpublished  manuscript. 

(8)  Nevel,  D.  (1968)  The  general  solution  of  a wedge  on  an  elastic 
foundation.  U.S.  Army  Cold  Regions  Research  and  Engineering 
Laboratory  Research  Report  2^7 • 

(9)  Whittaker,  E.T.  and  G.N.  VJatson  (1952)  A course  of  Modern 
Analysis.  Cambridge  at  the  University  Press. 


s'  1 


i 

.. 


n 


V 


r 


* 


* 


408 


APPENDICES 

In  the  following  Appendices  A,  B,  and  C,  transformation  of  tensor 
components  utilized  in  this  paper  are  derived  ty  use  of  the  tensor  nota- 
tion where  tensors  are  expressed  in  combinations  of  components  and  base 
vectors.  This  tensor  expression  yields  simpler  and  more  enjoyable 
analysis  of  component  transformations  in  Euclidean  space  than  the  con- 
ventional tensorial  expressions  where  base  vectors  are  omitted,  because 
geometric  and  mechanical  quantities  are  explicitly  shown  in  the  former 
and  therefore  the  meaning  of  the  step  by  step  computation  is  clear. 

In  the  Appendix  D,  the  deformation  for  the  case  a = °°  is  derived. 

In  the  Appendix  E,  the  buckling  of  the  semi-infinite  plate  is  discussed. 
It  is  interesting  to  note  that  both  cases  pertains  to  the  case  of 
= °°  but  they  are  substantially  different. 

A.  Transformation  of  (1.10)  to  (l.ll). 

Shears  Q and  Q in  rectangular  coordinates  are  the  magnitude  per 
x y 

unit  length  of  the  shears  acting  on  a side  normal  to  the  x-axis  and  y- 

axis,  respectively,  (see  Figure  10).  We  shall  begin  with  expressing 

and  Q as  components  of  a vector.  Let  c and  c be  unit  vectors  in 
xy  x y 

the  x-  and  y-directions . In  Figure  11,  let  be  a unit  vector  normal 

to  the  hypotenuse  AB  (Fig.  ll).  Vector  c is  given  by 

c &s=c  da+cdx  ( A.  1 ) 

n x * y 

because  c thus  defined  satisfies  the  condition 
n 

c • c = du/ds 

n x s 

and 

c • c = dx/ds 

n y 
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where  the  dot  ( • ) between  two  vectors  means  the  scalar  product  of  the 


two  vectors.  Let  Q be  the  shear  per  unit  length  of  the  hypotenuse 
AB.  It  is  given  by 

Q ds  = Q dy  + Q dx  ( A . 2 ) 

s x ^ y 

We  can  now  prove  that  the  equation 

Q = Q c + Q c (A. 3) 

x x y y 

is  the  desired  vector  combination  of  Q and  Q , because  the  relation 

x y 


M 


is  satisfied. 

Substitute  (1.10)  into  (A. 3)  and  transform  the  result  to  a tensor- 
invariant  form: 

Q = V*  M + Vw  • N (A.M 


where 


1 

n 3 3 

V = C — + C — 

X oX  y o tj 

(A. 5) 

4 

M = M c c + M (c  c + c c ) + M c c 
xx  x x xy'  x y y x yy  y y 

(A. 6) 

and 


oU 
r 

32* 

3u, 

3r* 

3u 

3? 

3u„ 


ot)  V 


0 


r* 


= 0 


= 0 


u„ 


In  the  polar  coordinates,  (A. 5)  becomes 


J 


v = 

u 

V 

3 

3r 

+ Us'r30 

Q = 

3_ 

3r 

3 \ m / 3w 

du  \ 

+ u 1 

e 3 ej 

where 

M 

M u u 
rr  v v 

+ We  + ueur>  + 

M u u 
66  0 0 

j .N, 


(A. 9) 


(A. 10) 
(A. 11) 

(A. 12) 


and 


N = 


N u u + A'  (u  ii  + u.u  ) + N uQu„ 
rr  r-  r rd  r 6 6 r 60  0 0 


(A. 13) 


, ■ In  the  polar  coordinates,  (A3)  becomes 

q = eyur  + eQu6  (a.iM 

Carry  out  the  differentiation  in  (A.ll)  by  use  of  (A. 9)  and  the  scalar 

L ^ 1 

products  indicated  by  dot  ( • ) and  identify  the  components  with  those 
| ? of  (A. lb),  then  one  finds  (l.ll). 

| > ; B . Transformation  cf  (l.l)  to  (1.2) 


We  shall  prove  the  formula  in  the  general  polar  coordinates. 


The  left-hand  side  of  (B.l)  is  a tensor-invar ient  form  N ••  W w 
where  N is  given  by  (A.?),  and  77  U is  a dyadic. 
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The  double  dot  (••)  of  ab'^cd  means  (b.cHa.d).  In  polar  coordinates, 

/ 9 9 \ / 9 \ 

= [Ur  9?  + U6  r30/  [Uy  9?  + U0  Fa?) 

Carrying  out  the  differentiation  given  by  (A. 9),  one  finds 


2 

77U  = — — u u + — (—  — ) (u  u„  + u„u  ) 
YV  . 2 r r 9r  r 90  r 6 0 r 

9r 

2 

/I  9 U 1 9tJ , 

2 2 r 9r  U0U0 

2°  cu 


(B.2) 


Carrying  out  the  double  dot  products  by  use  of  N in  (AT)  and  77  u in 
(B.2),  one  finds  that  N ”Wu  becomes  the  right-hand  side  of  (B.l). 
C.  Proof  of  (13-2) 

Substituting  Equation  ( 1 . 1* ) of  Mansfield  (l),  one  can  transform 
the  tensor  equation 

d = OCC  + OCC  + T (cc  + cc) 

xxx  y y y xy  x y y x 


to  an  tensor-invariant  form 
o = 

The  tensor-invariant  operator 


[£Z/(1-V2)]  (yvj 


(C.l) 


*99 

A = c c t— 

x ay  y ox 


becomes 


A = u 


( C . 2) 


> r39  0 9r 

in  the  polar  coordinates. 

Substituting  (A. 10)  and  (C.2),  and  carrying  out  the  differentia- 
tion as  given  by  (A. 9),  one  finds  that  (C.l)  becomes 
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a 


ratio 

A l(u2)  Vi[w2) 

B L(u1)  = " 

can  be  computed  by  using  (12.1U)  and  (12.15)-  Thus  one  finds 

A (l+C^)  cosIh  + (1_^)  sinIh 

B (l+C^)  sinIh  “ cosIh 

where 


Thus  the  normalized  deformation  U„(r)  is  given  "by 


1 -1 

u/;(r)  = R{x)/R.  ccs(Kx)  - Ift)  + (/3)  Sin  (J(x)  - Ifc) 


Letting 


X " ^ = 5 


and  assuming  £ to  he  finite,  one  may  let  r-*»  in  (D.l).  Thus  one  finds 

- 1 

u^(r)  = exp(-£/^2)  cos  (£//2)  + (/3)  sin(£//2)  (D.2) 

The  maximum  of  u^(r)  occurs  at 
tan(£//2)  = - 2 + /3 

vhich  is  negative.  Therefore  W^(x)  is  always  decreasing  for  £ 0. 

The  deformation  at  a = therefore,  does  not  take  a minimum,  as  those 

2 

(shown  in  Figure  5 and  6)  of  case  1-v  < a < 2 do. 

E.  Buckling  of  semi- infinite  plate 

We  shall  show  that  the  deformation  discovered  in  the  preceding 
section  is  different  from  the  buckling  deformation  of  a rectangular 
semi-infinite  floating  plate. 

We  assume  that  uniform  oressure  N is  anplied  on  the  axis  y,  the 

xx 


axis  x extending  from  x = 0 to  x - <*>.  Then  from  (l.l)  one  gets 


Ak 

d v 

— £ + yw  = ft 
dr 


(E.l) 


XX  , d 

or 


where  we  have  put  <7=0.  Defining  new  x by  the  quotient  of  old  x 
divided  by  the  characteristic  length  i = (D/ y)^  , (E.l)  becomes 

U 2 

d w _ d w i 

dr  dr 

where  we  have  put 


i* 


I 


p 


la 


I . 


f , . 


N = -2ayi 
XX  o 

Because  N < 0,  the  relation 
xx  *= 


(E.3) 


a >_  0 

must  be  satisfied.  The  solution  of  the  differential  equation  (E.2)  is 
= expU^x)  (E.U) 

where  

O f~ r\ 

(E.5) 


X = -a  + fa?  - 1 


The  real  part  of  X^  must  be  chosen  to  be  negative.  The  convention  with 
regard  to  suffix  k is  still  observed. 

When  a = 1,  the  general  solution  is  given  by 
w = A cos  x + B sinx 

which  we  do  not  accept,  because  the  boundary  condition  at  x = <=°  cannot 
be  met. 


When  0 4 a < 1,  letting 

a = cos  2n  (E.6) 

i 

; the  general  solution  is  given  by 

— ft'T* 

w = e [A  cosax  + B sinax]  (E.7) 

I 

where 


s 

* 

[ I 

g — sinn 

(E.8) 

Y 

and 

a = cosn 

(E.9) 

>; 

* 

The 

condition 

1 r ii 

0 < n < -n/h 

(E.10) 

: £ 

must 

be  met  to  satisfy  the  conditions  with  regard  to  a and  X^. 

V/ hen  a > 1,  letting 


I 


>: 


a = cos h 2ri 

one  finds  four  fundamental  solutions 

cosvx,  sinvx,  cos(x/v),  and  sin(x/v) 

where 

v = exp(rt) 

We  shall  discuss  below  only  the  case  0 i a < 1,  because  the  boundary 
condition  at  x = °>  cannot  be  satisfied  in  the  other  cases. 

The  free-edge  condition  for  this  case  is 


djj 

dx2 


and 


3 

^+2a^ 

dx3 


= 0 


} (E.ll) 


The  second  equation  of  (E.ll)  is  derived  from  the  first  equation  of 
(1.10).  Substituting  (E.7),  one  finds  that  the  eigenvalue  is  given  by 
n = ir/6 


i.e. 


a = 1/2 


(E.12) 


The  deformation  for  this  case  is 

w(x)  = A exp(-x/2)  cos[(/3n/2)  + ( tt/6 ) ] (E.13) 

where  A is  arbitrary.  The  maximum  of  w(x)  occurs  at 
x = W(3/3) 

Therefore  the  deformation  in  this  section  is  different  from  the  deformation 
in  the  preceding  section. 

For  the  simple-edge  condition,  the  eigenvalue  is  given  by 
a = 1 

which  we  do  not  accept. 
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Nonlinear  Theory  of  the  Response  of 
Pavements  to  Vibratory  Loads 


Richard  A.  Weiss 

Pavement  Investigations  Division 
Soils  and  Pavements  Laboratory 
U.  S.  Army  Engineer  Waterways  Experiment  Station 
Vicksburg,  Mississippi 

ABSTRACT.  A nonlinear  model  of  the  pavement  response  to  a dynamic 
load  is  presented  which  has  applications  to  the  vibratory  nondestructive 
method  of  testing  pavements.  The  parameters  of  the  model  have  been 
determined  by  comparison  with  actual  dynamic  load-deflection  curves.  The 
model  gives  a quantitative  description  of  the  dependence  of  the  measured 
dynamic  load-deflection  curves  on  the  strength  of  the  pavement,  static 
load  of  the  vibrator,  and  the  frequency  of  operation  of  the  vibrator. 

The  model  determines  the  elastic  modulus  of  the  subgrade  from  the  measured 
load-deflection  curves.  The  nonlinear  dynamical  model  is  applied  to  the 
laboratory  determination  of  the  resilient  modulus  with  the  result  that 
the  resilient  modulus  is  expressed  analytically  in  terms  of  the  static 
confining  pressure,  dynamic  deviator  stress,  and  material  parameters  which 
describe  the  linear  and  nonlinear  behavior  of  soil  under  dynamic  and  static 
force  loading. 

I . INTRODUCTION 

The  Waterways  Experiment  Station  (WES)  has  for  many  years  used  the 
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method  of  nondestructive  testing  of  airfield  pavements.  This  method  of 

testing  pavements  is  relatively  quick  accurate,  reproducible,  and  in- 
expensive. When  the  nondestructive  test  method  is  used  an  airfield 
runway  need  not  be  shut  down  for  long  periods  of  time  as  is  the  case  for 
the  destructive  testing  of  pavements. 

The  instrument  used  for  the  vibratory  nondestructive  testing  of  pave- 
ments is  a mechanical  vibrator  whose  force  payload  to  the  pavement  surface 
is  generated  either  by  a hydraulic  system  or  a mechanism  of  counter-rotating 
weights.  The  WES  16-kip  vibrator  applies  a static  load  of  16  kips  to  the 
pavement  surface  and  a dynamic  load  to  the  pavement  surface  which  can  be 
varied  from  0 to  15  kips.  Both  static  and  dynamic  loads  are  applied  to 
the  pavement  surface  through  a circular  18-in.  diameter  baseplate. 

Four  types  of  nondestructive  tests  are  generally  performed  on  pave- 
ments, and  these  consist  of  the  following  measurements: 

su  Dynamic  load-deflection  curves  giving  the  dynamic  amplitude 
as  a function  of  the  dynamic  load, 
b.  Frequency  response  spectrum  giving  the  dynamic  amplitude 
as  a function  of  frequency  for  a fixed  dynamic  load. 

£.  Deflection  basin  measurements. 

d..  Rayleigh  wave  dispersion  curves  giving  phase  velocity 
versus  wavelength. 

Only  the  dynamic  load-deflection  curves  and  the  frequency  response  spectrum 
measurements  will  be  considered  in  this  paper. 


A typical  measured  frequency  response  curve  appears  in  Fig.  1,  and 
a typical  measured  dynamic  load-deflection  curve  appears  in  Fig.  2.  Most 
of  the  WES  measurements  of  the  dynamic  load-deflection  curves  were  done  at 
a frequency  of  15  Hz.  Experience  has  shown  that  the  dynamic  load-deflection 
curves  are  relatively  smooth  for  this  frequency.  The  frequency  response 
spectrum  may  contain  multiple  resonance  peaks. 

Two  basic  theoretical  approaches  have  been  taken  to  describe  the  ex- 
perimental data: 

1.  a linear  theory  of  the  frequency  response  spectrum 

2.  a nonlinear  theory  of  the  dynamic  load-deflection  curves 

The  two  types  of  dynamic  pavement  response  models  that  have  been  considered 
are  shown  in  Fig.  3.  Single-mass  and  multiple-mass  models  have  been  devel- 
oped in  the  linear  theory,  while  only  a single  mass  model  was  developed 
with  a nonlinear  spring  constant.  It  was  found  that  multiple-mass  pavement 
response  models  are  somewhat  intractable  because  they  contain  many  para- 
meters. Only  the  single-mass  pavement  response  models  are  considered  in 
this  paper.  The  elements  of  the  spring-mass-dashpot  model  must  be  determined 
in  terms  of  the  characteristic  forms  of  the  measured  frequency  response 
spectrum  and  the  measured  dynamic  load-deflection  curves. 

II.  DYNAMIC  FREQUENCY  RESPONSE  THEORY 

The  dynamic  frequency  response  spectrum  measured  at  the  pavement  sur- 
face is  often  quite  complex  and  difficult  to  interpret.  Many  factors  prob- 
ably contribute  to  produce  its  characteristic  shape.  In  order  to  extract 
some  information  about  pavement  and  sub  :rade  structure  from  the  measured 
dynamic  frequency  response  spectrum  it  is  necessary  to  use  a simple 
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dynamic  pavement  response  model  to  fit  the  measured  frequency  response 
spectrum  with  the  theoretically  predicted  frequency  response  spectrum.  This 
fit  will  yield  the  parameters  of  the  dynamic  model  from  which  the  pavement  and 
subgrade  structure  can  be  determined.  The  frequency  response  spectrum  of 
the  single-mass  model  has  one  resonance  peak,  and  this  predicted  resonance 
peak  is  fit  to  the  second  resonance  peak  of  the  measured  response  spectrum. 

The  second  peak  is  chosen  because  an  examination  of  many  frequency  response 
spectra  has  shown  this  peak  to  be  more  consistent  and  less  affected  by 
electronic  equipment  than  the  other  peaks.  Generally  the  second  peak  is 
the  most  pronounced. 

The  second  resonance  peak  is  associated  with  a resonance  frequency  and 
a resonance  amplitude  as  indicated  in  Fig.  4.  The  resonance  amplitude  and 
frequency  was  used  to  calculate  the  elements  of  the  spring  model  — effective 
mass,  effective  spring  constant,  and  effective  damping  constant.  The  elements 
of  the  single-mass  spring  model  can  be  simply  related  to  the  resonance  peak. 
DETERMINATION  OF  ELEMENTS  OF  THE  SPRING  MODEL 

6"9 

Within  the  framework  of  the  single-mass  spring  model  the  dynamic 
amplitude  of  the  pavement  surface  response  to  a sinusoidal  dynamic  load 
can  be  written  as 


A = Fd/S  (1) 

S = / (k-moj2)Z  + C2o)2  (2) 

where  A = amplitude  of  the  dynamic  displacement  of  the  pavement  surface  as 
represented  by  a linear  spring  model,  Fd  = dynamic  load  applied  to  the  pave- 
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ment  surface,  S = dynamic  stiffness,  k = linear  spring  constant,  m = 
effective  mass  of  the  pavement- subgrade  system,  u * angular  frequency,  and 
C “ damping  coefficient.  The  resonance  frequency  and  amplitude^  can  be 
obtained  from  (1)  and  (2)  to  be 


■fu-  / 1 - 2D2 
2tt  V m 


(3) 


Ar 


2kD/  1 - D2 


(4) 


D = 


2/  km 


(5) 


where  fR  * resonance  frequency,  A^  = responance  amplitude,  and  D = damp- 
ing ratio.  The  three  elements  of  the  linear  spring  model  that  are  to  be 
obtained  are  k , m and  C . In  order  to  determine  these  three  parameters 
another  piece  of  information,  in  addition  to  fR  and  , is  necessary. 

This  is  given  by 


r 1 


J(m)  = A^/A 

where  J(w)  = ratio  of  the  resonance  amplitude  to  the  amplitude  at  some 
nearby  frequency.  The  theoretical  value  of  this  ratio  is  given  by 


(6) 


1 

•t 


, „ \ J (k  - mw2)2  + C2m2 

J (k  ,m,C ,w)  = . - '2-  - 

y (k  - mm2)  + c2u2 


(7) 
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The  three  measured  quantities  which  are  extracted  from  the  frequency  response 
curve  are  fR  , and  J(w). 

The  spring  model  elements  k , m and  C must  now  be  obtained  in  terms 
of  fR  , and  J(m).  The  equations  in  (3)  - (6)  can  be  inverted  to  deter- 
mine k and  D in  the  following  manner 


k *■  4ir2mf 2 1 / 1 + 
R 


Arr^f  2 


ft 


(8) 


„2  = I - I 

2 2 


1 + 


r2mfRAR/  J 


-1/2 


(9) 
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ii 

k 

n 
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The  k and  D terms  have  now  been  expressed  in  terms  of  the  effective  mass. 
Using  (8)  and  (9)  it  is  now  possible  to  express  J(k,m,C,w)  in  terms  of  the 
effective  mass  as  the  only  unknown  parameter  as  follows 


J(m,u>)  = 


/ 

L mu2 

| , 4mD2u2 

l_ 

1 " k 

1 + k 

.1 1 mm2 1 

4mD2oi2 

/ 

I - K 

, R 

r 

l1  k j 

1 k 

(10) 


The  only  unknown  independent  variable  in  J(m,m)  is  now  the  effective  mass. 
By  sweeping  through  a series  of  values  of  m and  calculating  numerical 
values  of  J(m,m)  it  is  possible  to  determine  the  specific  value  of  m for 

which  J(n,w)  is  equal  to  the  experimental  value  of  the  J-ratio,  i.e., 
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J(m,w)  = J(u>).  This  condition  determines  the  value  of  the  effective  mass 


required  by  the  spring-mass-dashpot  model  to  fit  the  experimentally  mea- 
sured dynamic  frequency  response  curve.  Placing  this  calculated  value  of 
the  effective  mass  into  (5),  (8)  and  (9)  gives  the  proper  values  of  k and 
C required  to  fit  the  experimental  frequency  response  data.  The  necessary 
computer  programs  to  accomplish  this  work  on  a digital  computer  have  been 
developed  and  will  be  referred  to  as  the  WES  Dynamic  Frequency  Response 
Program. 

DETERMINATION  OF  SUBGRADE  MODULUS  BY  FREQUENCY  RESPONSE  METHOD 


The  value  of  the  spring  constant  that  is  determined  from  the  measured 
frequency  response  spectrum  will  be  used  to  determine  the  subgrade  modulus. 
The  theory  of  the  linear  elastic  layered  half-space  predicts  a theoretical 
value  of  the  static  spring  constant  k^,  which  depends  on  the  radius  of 
the  loaded  area  and  on  the  elastic  constants  of  the  subgrade  and  the  pave- 
ment layers.  Computer  programs  are  available  which  calculate  the  value 
of  kT  if  the  Young's  modulus  and  Poisson's  ratio  of  each  layer  of  the 
half-space  is  known.  A well  known  computer  program  of  this  kind  is  the 
Chevron  Program.  The  procedure  for  determining  the  Young's  modulus  Eg 
of  the  subgrade  is  shown  in  Fig.  5.  The  measured  values  of  f^  , A^  and 
J(u)  are  inserted  into  the  WES  Dynamic  Frequency  Response  Program  and 
values  of  k , m and  C are  determined.  The  Young's  modulus  and  Poisson's 
ratio  of  the  layers  of  the  pavement  are  selected  and  entered  into  the 
Chevron  Program.  The  subgrade  modulus  Eg  is  then  iterated  in  the  Chevron 
Program  and  a series  of  values  of  kT  are  determined.  The  proper  value  of 
E_  is  determined  by  the  condition 


na 


W 


k = k„ 


The  predicted  value  of  Eg  will  depend  on  the  values  of  the  elastic  moduli 
selected  for  the  pavement  layers. 

NUMERICAL  RESULTS  OF  FREQUENCY  RESPONSE  METHOD 

Values  of  k , m , C and  Eg  have  been  obtained  for  several  airport 
pavement  sites  and  are  listed  in  Table  I.  This  table  has  listed  the  sites 
according  to  increasing  values  of  the  Dynamic  Stiffness  Modulus  (DSM) , which 
is  the  slope  of  the  dynamic  load-deflection  curves  at  a dynamic  load  of  14 
kips.  It  is  seen  that  the  measured  spring  constant  k increases  with  in- 
creasing pavement  strength  and  that  k is  not  equal  to  the  DSM  value.  The 
effective  mass  is  presented  as  a ratio  to  the  above-surface  (vibrator)  mass, 
and  increases  with  the  strength  of  the  pavement.  The  effective  mass  is  not 
equal  to  the  above-surface  mass  and  any  theory  which  apriori  assumes  that 
m = cannot  be  used  to  fit  the  experimental  frequency  response  data. 

The  value  of  the  damping  constant  also  increases  with  increasing  pavement 
strength.  The  predicted  values  of  Eg  are  compared  to  those  modulus  values 
that  are  predicted  by  the  CBR  method  (Eg  = 1500  CBR) . The  values  of  Eg 
predicted  by  the  combined  WES  Frequency  Response  Program  and  the  Chevron 
Program  are  3 to  5 times  larger  than  those  predicted  by  the  CBR  method. 

There  are  several  possible  reasons  for  the  discrepancy  in  the  values 
of  Eg  predicted  by  these  two  methods: 

_a.  the  pavement-subgrade  system  is  nonlinear  under  dynamic  and 
static  loading 


b.  the  subgrade  is  not  uniform  and  the  theoretical  layered 


elastic  half-space  model  requires  a rigid  boundary  below 
the  subgrade 

£.  reflections  from  a lower  boundary  layer  add  to  the  motion 
of  the  pavement  surface 

When  a rigid  boundary  such  as  bedrock  is  present  relatively  close  to  the 
pavement  surface  it  is  possible  that  the  effects  listed  in  _b.  and  c_.  may  be 
of  importance  for  determining  the  motion  of  a pavement  surface  that  is  sub- 
jected to  a sinusoidal  dynamic  loading.  However,  the  discrepancy  between 
the  values  of  Eg  predicted  by  the  CBR  method  and  that  predicted  by  the 
frequency  response  spectra  method  also  occurs  in  cases  where  the  subgrade 
is  relatively  uniform  and  contains  no  obvious  discontinuities.  Therefore 
only  the  fact  that  the  response  of  pavements  and  subgrades  to  dynamic  and 
static  loads  is  nonlinear  remains  as  a possible  explanation  for  the  dis- 
crepancy in  thj  values  of  E determined  by  these  two  methods. 

o 

III.  NONLINEAR  THEORY  OF  PAVEMENT  RESPONSE  TO  DYNAMIC  SURFACE  LOADINGS 
An  alternative  method  for  determining  the  subgrade  modulus  from 
vibratory  nondestructive  test  data  is  the  use  of  the  dynamic  load-deflection 
curves  measured  at  the  pavement  surface  for  a fixed  frequency  and  a fixed 
static  load.  These  dynamic  load-deflection  curves  are  generally  nonlinear 
for  weak  pavements  and  become  more  linear  for  stronger  pavements.  Over  the 
years  the  WES  has  collected  an  extensive  set  of  dynamic  load-deflection  curves 
that  have  been  obtained  on  many  airfield  pavements  throughout  the  country. 

The  nonlinear  dynamic  load-deflection  curves  were  measured  at  a fre- 
quency of  15  Hz  and  at  a static  surface  loading  of  16  kips.  The  nonlinear 
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dynamic  theory  must  account  for  the  frequency  and  static  load  conditions 
under  which  the  dynamic  load-deflection  curves  were  measured.  The  pre- 
dicted subgrade  modulus  should  be  free  of  the  particular  loading  charac- 
teristics of  the  vibrator.  Therefore,  in  addition  to  the  static  Young's 
modulus  some  other  parameters  have  to  be  introduced  which  will  account 
for  the  observed  nonlinearity  of  the  dynamic  load-deflection  curves. 

These  nonlinear  parameters  must  also  account  for  the  nonlinear  behavior  of 
the  static  load-deflection  curves.  The  predicted  subgrade  modulus  value 
will  be  independent  of  the  particular  loading  characteristics  of  the 
vibrator  — frequency,  static  load,  and  dynamic  load.  Only  the  natural 
overburden  pressure  will  be  reflected  in  the  subgrade  modulus  value. 

The  determination  of  the  elastic  constants  and  the  static  and  dynamic 
nonlinear  parameters  of  the  pavements  and  subgrades  from  measured  dynamic 

load-deflection  data  requires  a nonlinear  dynamic  theory  of  pavement 
4 

response  . 

EQUATION  OF  MOTION  OF  A NONLINEAR  OSCILLATOR 

The  nonlinear  theory  of  pavement  response  to  a vibratory  load  assumes 
that  the  pavement-subgrade  system  can  be  described  by  a lumped  mass  non- 
linear oscillator  whose  equation  of  motion  is  written  as 


mx  + Cx  + k.„x  + bx3  + ex5  = F + F 
00  D S 


where  m = effective  mass  of  the  pavement-subgrade  system,  x = total 
displacement  of  the  pavement  surface  beneath  the  vibrator  baseplate,  C = 
damping  constant,  k = linear  spring  constant,  b = third  order  non- 


linear pa. ament  parameter,  e = fifth  ortLr  nonlinear  pavement  parameter, 


« dynamic  load  applied  to  the  pavement  surface,  and  F = static  load 
applied  to  the  pavement  surface.  The  total  displacement  of  the  pavement 
surface  is  decomposed  into  a static  and  a dynamic  part  as  follows 


x * x + E 
e 


where  xg  = static  elastic  displacement  of  the  pavement  surface,  and  £ = 
dynamic  elastic  displacement  of  the  pavement  surface.  Placing  (13)  into  (12) 


gives  the  following  equation  of  motion 


m£  + C£  + |kQ0  + 3bx2  + 5ex4|c  + b£3  + e£5  + £g(xe,£)  = Fj, 

where 


g(xfi,C)  = 3bxgC  + 10ex3C  + 10ex2£2  + 5exgC3 


For  convenience  in  manipulating  (14)  it  is  necessary  to  use  a time 
averaged  expression  for  (15) 


g(xfi,C)  = ^a  jbx2  + 5a2ex£  + a3b£2  + a^^ 


where  t a2  , a3  and  a^  are  coefficients  to  be  determined  from  the  mea- 
sured dynamic  load-deflection  data.  Combining  v 16)  and  (14)  gives  the  motion 
equation  as 


mC  + C£  + kQE  + b6C3  + enC5  = FD 


where 


k.  “ k + 3be,x2  f 5ee  x4 
0 00  2 e 4 e 


(19) 

(20) 


0 " 1 + a3 

n - 1 + a4 

e2  ■ 1 + ax  (21) 

e4  “ 1 + a2  (22) 

The  parameters  0 , n » e2  and  depend  on  the  pavement  strength  and  are 

determined  by  requiring  (17)  to  adequately  describe  the  dynamic  load-deflection 
curves.  The  nonlinear  parameters  b and  e determine  the  static  load-deflec- 
tion curves,  as  can  be  seen  from  (12) 

F„  * k x + bx3  + ex5  (23) 

S 00  e e e 

In  general  it  is  found  that  b < 0 and  e > 0 for  pavements  and  most  sub- 
grades. 

THEORY  OF  DYNAMIC  LOAD-DEFLECTION  CURVES 

The  problem  remains  to  solve  the  nonlinear  equation  (17).  This  can 
be  done  by  casting  (17)  into  an  equivalent  linear  form  for  which  the  dynamic 
amplitude  is  given  by 

5 - Fd/S  (24) 

where 

S = J (k  - mw^)2  + (25) 

where  S - dynamic  stiffness,  k = dynamic  spring  constant,  m = effective 
mass,  a)  - angular  frequency  and  C = damping  constant.  The  requirement  that 

(24)  and  (25)  be  a solution  of  (17)  is  that  the  spring  constant  in  (25)  is 

4 

given  cy 
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k = k0  + | b0C2  + | en54  (26) 


Therefore  the  spring  constant  for  a nonlinear  system  depends  on  the  dynamic 
and  static  displacements  of  the  pavement  surface. 


Placing  (26)  into  (25)  and  (24) , and  solving  for  the  dynamic  amplitude 

4 

yields  the  result 

F 

£ = jr-  (1  + a,t|)  + a <p2  + •••) 

P 

(27) 

where 

S0  - J (kQ  - mu)2)2  + C 2a>2 

(28) 

* = fd/s4o 

(29) 

<*1  “ - f-  b0(ko  - m“2) 

(30) 

a2“lll)  b2e2(ko  - mm2)2  - S2  | ne.(k0  -nuo2)  + \ \ 

(|f  b2®2  (3D 

As  seen  from  (27)  - (30)  the  degree  of  nonlinearity  of  a dynamic  load- 
deflection  curve  depends  on  the  strength  of  the  pavement  and  the  frequency 
of  operation  of  the  vibrator.  The  strength  of  the  pavement  affects  the 
degree  of  nonlinearity  of  the  dynamic  load-deflection  curves  through  the 
term  S'1*  that  appears  in  (27)  and  (29).  The  S*1*  term  shows  that  strong 
pavements  tend  to  be  more  linear  than  weak  pavements.  From  (30)  it  is  clear 
that  there  is  a critical  frequency  for  which  the  first  order  nonlinear  term 
vanishes  and  this  frequency  is  given  by 
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At  this  frequency  the  dynamic  load-deflection  curves  should  become  especially 
linear  in  the  regions  of  low  dynamic  force  if  the  second  order  nonlinear  term 
is  comparatively  small.  The  straightening  effect  at  the  critical  frequency 
will  not  be  strongly  evident  if  the  second  order  nonlinear  term  is  compara- 
tively large. 

DYNAMIC  NATURE  OF  THE  SPRING  CONSTANT 


The  measurement  of  the  dynamic  load-deflection  curves  determine  the 
linear  and  nonlinear  parameters  of  a pavement  system  — kQQ  , b , e , 0 , 
n , e2  , e4  . Equation  (26)  shows  that  the  spring  constant  k that  is 
determined  from  a dynamic  analysis  of  the  nonlinear  properties  of  a pave- 
ment-subgrade system  is  dependent  on  the  dynamic  and  static  displacements 
of  the  pavement  surface  as  well  as  on  the  elastic  constants  of  the  pave- 
ment-subgrade system.  Therefore  the  spring  constant  k that  is  determined 
from  the  dynamic  response  of  a nonlinear  pavement  system  is  a dynamic 
quantity  that  is  not  analogous  to  an  ordinary  static  spring  constant.  The 
theoretical  static  spring  constant  determined  from  a static  linear  elastic 
program  such  as  the  Chevron  Program  will  depend  only  on  the  elastic  constants 
of  the  pavement.  Therefore  the  value  of  k determined  from  the  dynamic 
response  data  of  a nonlinear  pavement  cannot  logically  be  compared  to  the 
static  Kp  value  determined  from  static  layered  elastic  computer  programs. 
Static  plate  bearing  tests  will  result  in  a spring  constant  which  will  also 
not  be  directly  comparable  to  the  spring  constant  determined  from  an  analysis 


of  dynamic  data. 

FINITE  DEPTH  OF  INFLUENCE 


The  static  linear  and  nonlinear  parameters  kQQ  , b and  e respec- 
tively can  be  related  to  the  elastic  parameters  of  the  pavement  layers  and 
to  the  depth  of  influence  of  the  static  stress-strain  field4.  The  finite 
depth  of  influence  is  written  in  terms  of  the  static  deflection  of  the 
pavement  surface  as 


J-  = + /x2  + /x4 

u 2 e 4 e 


For  the  simplest  case  of  a vibrator  placed  on  the  surface  of  a subgrade, 
the  static  parameters  are 


4Tra2ip./(l  - v) G 

b 

- 2v) 


(34) 

(35) 


where 


(36) 


(37) 


and  ip  = volume  factor  for  the  frustum  of  the  cone  of  stress  and  strain.  It 
is  through  equations  similar  to  (34)  - (37)  that  the  connection  is  made  be- 
tween the  elastic  parameters  of  the  pavement  system  and  the  theoretical 


expression  for  the  dynamic  stiffness  as  given  by  (25)  and  (26). 
MODEL  PARAMETERS 


n 


The  model  parameters  k,m,C,k  ,b,e,.i^,  ^ , 0 > 

n , e2  and  e4  depend  on  vibrator  characteristics  and  on  the  structure  of 
the  pavement  and  subgrade.  This  dependence  is  in  general  very  complicated 
and  difficult  to  determine  theoretically.  The  simplest  way  to  attach  the 
model  parameters  to  the  strength  of  a pavement-subgrade  system  is  to  deter- 
mine these  parameters  in  terms  of  the  measured  dynamic  stiffness  modulus 
(DSM)  of  a pavement.  The  DSM  is  the  slope  of  the  load-deflection  curve 
measured  by  the  WES  16-kip  vibrator  in  the  region  of  large  dynamic  load; 
it  is  in  fact  the  tangent  modulus  of  the  dynamic  load-deflection  curves  for 
15  kips.  The  DSM  value  is  a suitable  choice  for  a parameter  in  terms  of 
which  to  describe  the  model  parameters  because  it  is  a measure  of  the  bulk 
strength  of  the  pavement  and  subgrade.  The  model  parameters  expressed  in 
terms  of  the  measured  DSM  correspond  to  the  WES  16-kip  vibrator.  The 
vibrator  characteristics  appear  in  these  parameters  because  the  subgrade 
modulus  to  be  determined  is  intended  to  be  independent  of  the  dynamic  char- 
acteristics of  the  vibrator.  A corresponding  set  of  vibrator  parameters 
will  have  to  be  developed  for  any  other  vibrator  that  is  to  be  used  for 
nondestructive  testing  of  pavements. 

The  model  parameters  are  presented  as  a function  of  the  measured  DSM  in 
Figs.  6 through  15.  From  these  figures  it  is  seen  that  k , m , C and 
are  increasing  functions  of  the  strength  of  the  pavement.  The  dynamic  spring 
constant  presented  in  Fig.  6 corresponds  to  a dynamic  load  of  15  kips.  The 
depth  of  influence  of  the  static  stress-strain  field  increases  with  increasing 
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pavement  strength  while  the  static  deflection  of  the  pavement  surface 
under  a fixed  static  load  decreases  with  increasing  strength.  As  seen 
from  Fig.  7 the  effective  mass  is  generally  much  larger  than  the  above- 
surface mass,  and  it  would  be  incorrect  to  assume  that  the  only  lumped- 
mass  of  the  vibrator-pavement-subgrade  system  is  the  vibrator  mass  itself. 
The  effective  mass  of  the  dynamic  model  includes  the  inertial  effects  of 
the  mechanical  radiation  field  in  the  pavement  and  subgrade.  In  all  cases 
of  the  pavements  investigated  it  was  found  that  b < 0 and  e > 0 . 
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DETERMINATION  OF  SUBGRADE  MODULUS  FROM  DYNAMIC  LOAD-DEFLECTION  CURVES 

The  nonlinear  dynamic  response  model  that  has  been  outlined  in  the 
preceding  section  can  be  used  in  conjunction  with  a dynamic  load-deflection 
curve  measured  at  the  pavement  surface  to  determine  the  modulus  of  the  sub- 
grade beneath  the  pavement.  A computer  program  has  been  developed  which 
calculaces  the  theoretical  dynamic  response  of  a pavement  in  terras  of  the 
elastic  moduli  of  the  pavement  layers  and  subgrade  and  in  terms  of  the 
empirically  determined  parameters  0 > D , , m and  C which  have 

been  expressed  in  terms  of  the  measured  DSM  values  of  the  pavement.  A 
typical  example  of  the  vibratory  nondestructive  input  data  to  the  computer 
program  is  shown  in  Table  II.  The  computer  program  calculates  a theoretical 
load-deflection  curve  in  terms  of  the  b and  e coefficients  that  are 
determined  from  measured  load-deflection  curves  and  in  terms  of  the  elastic 
moduli  of  the  pavement  layers  and  the  subgrade.  The  elastic  moduli  of  the 
pavement  layers  are  selected  from  laboratory  tests  and  CBR  measurements. 

The  subgrade  modulus  is  then  determined  by  requiring  that  the  theoretically 
predicted  dynamic  load-deflection  curve  agree  with  the  measured  dynamic  load- 
deflection  curve.  This  procedure  for  determining  the  subgrade  modulus  is 
shown  in  Fig.  16.  The  numerical  results  of  this  procedure  for  a few  pavement 
sites  are  presented  in  Table  III.  The  values  of  the  subgrade  modulus  pre- 
dicted by  the  nonlinear  dynamic  response  theory  are  in  general  agreement  with 
those  predicted  by  the  empirical  relation  Eg  = 1500  CBR. 

IV.  LABORATORY  CONFIRMATION  OF  VIBRATORY  NONDESTRUCTIVE  FIELD  TEST  DATA 
It  is  of  interest  to  be  able  to  correlate  the  laboratory  value  of  the 
resilient  modulus  M^  of  a soil  sample  taken  from  the  subgrade  at  a pavement 
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or  soil  site  for  which  the  subgrade  modulus  has  been  predicted  by  the  vibratory 
nondestructive  testing  method.  Such  a correlation  is  difficult  to  achieve 
because  the  loading  conditions  on  the  soil  sample  for  the  laboratory  tests  are 
different  from  the  loading  conditions  on  the  subgrade  during  vibratory  non- 
destructive testing.  The  loading  conditions  differ  in  terms  of  the  magnitude 
of  the  static  and  dynamic  stresses  and  in  terms  of  the  frequency  of  application 
of  the  dynamic  stress. 

In  its  natural  state,  an  element  of  soil  in  the  subgrade  is  subjected 
only  to  the  overburden  pressure.  When  a vibrator  is  operated  on  the  surface 
of  a pavement  or  subgrade,  an  additional  static  and  dynamic  stress  is  applied 
to  an  element  of  soil  in  the  subgrade.  For  the  WES  16-kip  vibrator  the  static 
load  applied  to  the  surface  is  16  kips,  while  the  dynamic  load  can  be  varied 
up  to  15  kips  and  is  applied  sinusoidally  with  a frequency  of  15  Hz.  The 
stress  field  in  the  subgrade  is  nonuniform  and  can  be  calculated  by  standard 
elasticity  theory. 

The  laboratory  sample  for  resilient  modulus  testing  is  cylindrical  in 
shape  with  a typical  diameter  of  3 inches  and  a length  of  6 inches.  The 
cylindrical  sample  is  subjected  to  a static  confining  pressure  and  then  a 
dynamic  load  is  applied  in  the  axial  direction.  The  stress  is  uniform  along  the 
axis  of  the  laboratory  sample.  The  total  stress  along  the  axis  of  the  labor- 
atory sample  is  written  as 


o = oD  + as 


(38) 


where  = dynamic  stress  in  axial  direction  of  sample,  and  a = confining 
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pressure.  The  axial  dynamic  stress  is  also  called  the  deviator  stress  and 
is  written  as  a = a - a , where  a =*  total  stress  along  the  axis  of  the 

1/  u 

specimen.  The  resilient  modulus  has  been  measured  for  a number  of  soil 

and  pavement  materials , and  has  been  found  to  depend  on  Og  and  . 

The  dependence  of  on  the  dynamic  deviator  stress  is  such  that  at 

first  decreases  with  increasing  values  of  , attains  a minimum  value,  and 

then  increases  with  further  increase  of  the  deviator  stress^. 

The  dynamic  stress  acting  along  the  axial  direction  of  the  soil  specimen 

during  the  laboratory  resilient  modulus  test  is  applied  as  a series  of  pulses 

in  the  form  of  haversines  with  a pulse  of  1 second  duration  being  applied 

/ 

every  3 seconds.  The  characteristic  frequency  of  the  dynamic  loading  on  the 
sample  will  therefore  be  in  the  range  of  0.3  - 1.0  Hz,  and  this  is  much  lower 
than  the  frequency  of  15  Hz  at  which  the  vibratory  nondestructive  field  tests 
are  conducted.  The  large  difference  in  the  frequencies  used  for  these  two 
types  of  tests  requires  that  an  adequate  account  of  frequency  effects  be 
included  in  the  theoretical  analysis  of  both  laboratory  and  field  vibratory 
tests. 

NONLINEAR  DYNAMICAL  ANALYSIS  OF  THE  RESILIENT  MODULUS  TEST 


A dynamical  theory  of  the  resilient  modulus  test  has  been  developed  which 
is  similar  in  form  to  the  analysis  developed  for  the  vibratory  nondestructive 
field  tests.  The  basic  result  of  this  theory  is  that  the  dynamic  displacement 
of  the  test  specimen  can  be  written  as 


e ■ Vs  ■ AcVs 


S * /(k  - mu2)  + 


where  £ , and  = resilient  dynamic  displacement,  dynamic  load,  and 
dynamic  stress  on  the  cylinder  end  in  the  axial  direction;  S , k , m , C 
and  A » dynamic  stiffness,  spring  constant,  effective  mass,  damping 
constant,  and  area  of  loaded  end  of  the  cylinder  respectively;  w = effective 
angular  frequency  component  of  the  dynamic  load  applied  to  the  soil  sample. 

The  nonlinear  theory  of  vibrations  that  was  outlined  earlier  in  this 
paper  for  the  vibratory  nondestructive  field  tests  can  also  be  used  to  calculate 
the  quantities  in  (39)  and  (40) . This  nonlinear  theory  shows  that  the  spring 
constant  is  given  by 

k = kQ  + |b9£2  + (41) 


k„  = + 3be  x2  + 5ee, x4 
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where  b , e , 6 , n , e2  and  e^  = parameters  which  characterize  the  soil 

sample,  and  xg  = resilient  static  displacement  of  the  soil  sample  in  the 

axial  direction.  The  coefficients  k , b and  e could  be  determined  from 

00 

the  resilient  static  stress-strain  curve  if  such  a curve  could  be  measured. 
The  resilient  static  stress-strain  curve  of  the  soil  sample  is  determined  by 


F„  * ocA_  - k„x  + bx3  + ex5  (4^ 

SSCOOeee 

where  o<,  » static  confining  pressure,  and  Fg  * total  static  force  applied 
to  the  cylinder  end. 

4 

The  solution  of  (39)  - (42)  can  be  written  as 
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£ - g-|l  + Clj*  + a2^2 


where 


^(k  - mu)2)2  + C2u>2 
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a1  * - ^b0(ko  - no)2) 
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The  dynamic  stiffness  of  the  soil  sample  car.  be  obtained  from  (39)  and  (44) 


to  be 


S = SQ(1  + Bj*  + B.*2) 
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The  quantities  necessary  for  the  calculation  of  the  resilient  modulus  have 
now  been  determined. 


CALCULATION  OF  THE  RESILIENT  MODULUS 

The  resilient  modulus  is  defined  as  the  slope  of  the  unloading  portion  of 

the  dynamic  stress-strain  curve  of  the  soil  sample,  snd  is  given  by 
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(52) 


where  = dynamic  strain  in  axial  direction,  L * length  of  the  soil  sample, 

and  A„  - area  of  end  of  the  cylindrical  sample.  In  (52)  £ is  assume!  to 

v 

describe  the  unloading  portion  of  the  resilient  dynamic  load-deflection  curve 
of  the  soil  sample.  Combining  (44)  and  (52)  gives 


M = M f 
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(53) 


where 
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For  the  low  frequency  and  small  mass  with  which  the  resilient  modulus  tests 
are  conducted,  the  inertial  and  damping  terms  in  (40)  and  (45)  can  be  neglected 
and  the  following  approximations  can  be  made 


The  same  approximations  can  be  made  in  (47)  and  (48) . 
(58)  gives  the  following  approximation 


(57) 

(58) 

Combining  (42)  , (56)  and 
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where 


E = — V 
b0  Ac  *0  0 


(60) 
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(61) 


E4  - Tr  5eG4 
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(62) 


The  quantities  Eq  , E^  and  E^  are  soil  parameters  which  are  independent 
of  the  size  of  the  soil  sample  and  machine  characteristics.  The  calculation 
of  the  resilient  Poisson's  ratio  requires  further  study. 

The  expression  for  given  by  (53)  - (56)  characterizes  the  resilient 

modulus  in  terms  of  and  u . The  parameters  required  to  describe 

the  resilient  modulus  are  kQQ  , b , e , 6 , n , , m , L , . These 

parameters  will  depend  on  the  type  of  testing  machine,  size  of  soil  sample,  and 
the  tvpe  of  soil  constituting  the  soil  sample;  and  therefore  the  parameters 
will  have  to  be  determined  for  each  type  of  testing  machine.  Typ -cal  values  of 
the  parameters  describing  a resilient  modulus  test  as  described  by  (39)  - (62) 
are  given  for  lean  clay  in  Table  IV.  It  is  possible  to  determine  resilient 
modulus  parameters  which  are  independent  of  the  size  of  the  soil  sample  and 
independent  of  the  type  of  testing  apparatus.  The  parameters  E^  , E^  and  E^ 
that  occur  in  (60)  - (62)  are  soil  parameters  and  are  independent  of  the  sample 
size  or  loading  conditions.  It  is  the  quantity  E^  that  must  be  compared  with 
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the  value  of  E„  determined  from  the  vibratory  load-deflection  curves  that 
were  measured  directly  on  the  subgrade.  The  value  of  Eg  was  determined  in 
a manner  such  that  its  value  is  independent  of  the  static  and  dynamic  loads 
exerted  by  the  vibrator. 

The  preceding  analysis  shows  that  the  characteristic  shape  of  the  non- 
linear dynamic  load-deflection  curves  measured  in  the  field  by  the  WES  16-kip 
vibrator  is  due  in  part  to  the  basic  nonlinear  response  of  the  material  in  the 
subgrade  to  dynamic  loads.  The  signs  of  the  coefficients  describing  the  resil- 
ient modulus  test:  >0  , a2  > 0 ,61<0,62>0,b<0,  and  e > 0 , 

determine  to  a large  extent  the  signs  of  the  corresponding  coefficients  deter- 
mined from  the  vibratory  nondestructive  tests  conducted  on  pavements  and  sub- 
grades. However,  inertial,  damping  and  frequency  effects  will  affect  the 
values  of  and  a that  are  determined  by  vibratory  nondestructive  test- 

ing. For  the  vibratory  nondestructive  tests  done  on  pavements  and  subgrades  at 
15  Hz,  it  is  generally  found  that  oij  > 0 and  <*2  > 0 which  is  in  agreement 
with  the  signs  of  the  corresponding  coefficients  describing  the  resilient 
modulus  laboratory  test.  For  frequencies  different  from  15  Hz  and  for  excep- 
tional pavement  cases  it  is  found  that  > 0 and  <*2  < 0 or  a1  < 0 and 
a2  > 0 . Therefore,  the  combination  of  the  large  effective  mass  associated 
with  a pavement  and  subgrade,  and  the  relatively  high  frequency  of  operation 
of  the  WES  16-kip  vibrator  can  produce  a dynamic  load-deflection  curve  which 
has  a shape  which  is  considerably  different  from  the  shape  of  the  dynamic  load- 
deflection  curve  measured  in  the  laboratory  during  3 resilient  modulus  test. 

Because  of  the  finite  size  of  the  soil  sample  for  the  resilient  modulus 
test,  the  effective  mass  of  the  soil  sample  is,  to  a good  approximation,  equal 
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Co  the  actual  mass  of  the  sample.  The  effective  mass  that  enters  the  dynamical 
calculations  for  the  vibratory  nondestructive  field  tests  Is  generally  quite 
large  compared  to  the  moving  mass  of  the  vibrator  because  of  the  large  inertial 
effects  associated  with  the  pavement  and  subgrade.  The  large  effective  mass 
and  high  frequency  of  the  vibratory  nondestructive  field  tests  indicate  that 
the  inertial  and  damping  terms  are  comparable  or  larger  than  the  elastic  effects, 
mm2  % k and  Cio  •v  k . The  relatively  small  mass  of  the  soil  sample  used  for 
the  laboratory  resilient  modulus  tests  and  the  low  frequency  at  which  these 
tests  are  conducted  suggest  that  for  this  case,  mu2  <<  k and  Cu>  « k , and 
the  linear  and  nonlinear  elastic  properties  are  measured  directly  in  this  test. 

The  resilient  modulus  tests  combined  with  the  nonlinear  dynamical  theory 
of  these  tests  indicate  that  the  static  nonlinear  elastic  coefficients  b and 
e have  the  signs  b < 0 and  e > 0 . It  is  this  basic  property  of  soils  that 
is  responsible  for  making  the  corresponding  coefficients  determined  from  field 
tests  exhibit  the  same  signs.  It  is  the  nonzero  values  of  b and  e as  deter- 
mined from  the  resilient  modulus  that  are  responsible  for  the  finite  depth  of 
influence  of  the  static  stress-strain  field  in  the  subgrade  beneath  a static 
load  placed  on  the  pavement  surface.  The  intrinsic  nonlinearity  exhibited  by 
the  soil  during  the  resilient  modulus  tests  is  responsible  for  the  finite  depth 
of  influence  of  the  static  stress-strain  field  in  an  actual  soil  formation. 


V.  CONCLUSION 

The  nonlinear  dynamic  pavement  response  model  that  is  presented  in  this 
paper  gives  a quantitative  description  of  the  dynamic  response  of  a pavement 
surface  unuer  the  action  of  the  dynamic  and  static  load  applied  to  the 
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pavement  surface  by  the  WES  16-kip  vibrator.  The  model  parameters  - spring 
constants,  effective  mass,  damping  constant  and  finite  depth  of  influence  of 
the  static  load  have  been  determined  as  a function  of  pavement  strength  as 
represented  by  the  measured  DSM.  The  nonlinear  pavement  response  model  gives 
a theoretical  expression  for  the  pavement  response  in  terms  of  these  parameters 
and  in  terms  of  the  elastic  constants  of  the  pavement  and  subgrade.  For  a 
suitable  choice  of  the  elastic  moduli  of  the  pavement  layers,  it  is  possible 
to  predict  the  value  of  the  subgrade  modulus  from  the  dynamic  load-deflection 
curve  measured  at  the  pavement  surface. 

Of  much  importance  to  pavement  engineers  is  an  estimation  of  the  strength 
and  condition  of  a subgrade  as  measured  by  its  subgrade  modulus.  The  nonlinear 
elastic  response  model  of  the  dynamic  load-deflection  curve  combined  with 
measured  values  of  this  curve  is  sufficient  to  determine  the  subgrade  elastic 
modulus  quickly  and  accurately.  This  work  was  funded  by  the  Federal 
Aviation  Administration. 
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TABLE  II 


INPUT  OF  WES  NONLINEAR  DYNAMIC  PROGRAM 
SITE  B2A 

DSM  - 700  kips/in 


TABLE  III 

RESULTS  OF  WES  NONLINEAR  DYNAMIC  PROGRAM 


— 

SITE 

DSM 

kips/in 

SUBGRADE  CBR 

Eg (CBR) 
103psi 

Eg (WES  NONLINEAR) 
103psi 

TETS 

450 

8 

12.0 

13.0 

B2A 

700 

14 

21.0 

22.8 

NI8 

770 

18 

27.0 

25.9 

WES-AC 

780 

4 

6.0 

6.7 

W1 

860 

20 

30.0 

18.8 

N23A 

980 

18 

27.0 

28.1 

B3 

1680 

17 

25.5 

11.1 

W2C 

1940 

20 

30.0 

35.5 

P14A 

2120 

20 

30.0 

13.7 

P13 

2780 

20 

30.6 

17.7 

B1 

3120 

14 

21.0 

9.0 

WES-PCC 

3500 

4 

6.0 

6.8 

! 


r ^ 

a 

TABLE  IV 

PARAMETERS  DESCRIBING  THE  DYNAMIC  CHARACTERISTICS 
OF  THE  RESILIENT  MODULUS  LABORATORY  TEST 


Ac 

in2 

6.16 

61 

lb2/in4 

-5.66  x 1013 

L 

in 

6.0 

6 

2 

lb4/in8 

1.45  x 1027 

W 

lb 

5.0 

“l 

lb 2 /in4 

1.89  x 1013 

m 

lb  sec2/in 

0.013 

a2 

lb4/in8 

3.5  x 1026 

m 

sec-1 

6.0 

ei 

lb2/ in4 

-1.89  x 1013 

mu)2 

lb /in 

0.468 

e2 

lb4/in8 

7.2  x 1024 

C 

lb  sec/in 

30.0 

0 

dimensionless 

30.0 

Cu) 

lb/in 

180.0 

n 

dimensionless 

50.0 

o 

o 

M 

lb /in 

1.5  x 104 

G2 

dimensionless 

31.0 

k 

0 

lb /in 

4.0  x 104 

S 

dimensionless 

54.0 

k 

lb /in 

4.0  x 104 

Eo 

lb /in2 

1.5  x 104 

b 

lb/in3 

-2.0  x 107 

E 

2 

lb /in4 

-2.0  x 109 

e 

lb/ in5 

3.6  x 10n 

E 

4 

lb/in6 

9.7  x 1013 

Figure  1.  Typical  frequency  response  curve 
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Figure  2.  Typical  dynamic  load-deflection  curve 


DYNAMIC  PAVEMENT  RESPONSE  MODELS 
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1.  LINEAR  SPRING 

2.  NONLINEAR  SPRING 


Figure  3.  Single  and  double  mass  dynamic  pavement 
response  models 


FREQUENCY 


MEASURED  QUANTITIES:  fR 

A„ 


RESONANCE  FREQUENCY 

DYNAMIC  AMPLITUDE  AT  RESONANCE 

Ar/A  = RATIO  OF  AMPLITUDE  AT 
RESONANCE  TO  AMPLITUDE 
AT  ARBITRARY  FREQUENCY. 


Figure  L.  Measured  quantities  obtained  from  frequency 
response  curves 


FREQUENCY  RESPONSE  METHOD 
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fR  FREQUENCY 

INPUT  DATA 
fR  Ar  J = ar/a 

' 

WES  DYNAMIC  FREQUENCY 
RESPONSE  PROGRAM 

OUTPUT  DATA 
kmc 


BALTIMORE  1 

(B2) 

T 

77°  F 

AC 

E,  2.0 

*10s 

"l 

0.30 

H, 

5 

BLACK 

BASE 

E2  = 2.0 

xlO5 

0.35 

h2 

= 7 

GW-GM 

Ej  = 1.0 

x 105 

"3  = 

0.35 

H3 

= 9 

CHEVRON  LAYERED 
LINEAR  ELASTIC 
PROGRAM 


l ; 

rn 
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Es  = 65>10!PSI 
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Figure  5.  Method  of  calculating  subgrade  modulus  from 
measured  frequency  response  curves 

455 


r 


Figure  7.  Effective  mass  ratio  versus  measured  dynamic  stiffness 


Figure  8.  Damping  constant  versus  measured  values  of  DSM 


Leading  term  of  the  finite  depth  of  influence  versus 
measured  values  of  DSM 


Figure  13.  Dimensionless  parameter  0 versus  measured  values  of 

values  of  DSM 
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Critical  frequency  versus  measured  DSM 
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DYNAMIC  LOAD -DEFLECTION  METHOD 


BALTIMORE  (B2) 
T = 77°  F 

AC 

Ej  = 2.0  * 105  vx  - 0.30  Hj  = 5 

BLACK 

BASE 

E2  = 2.0  x 105  v2  0.35  H?  = 7 

GW-GM 

Ej  = 1.0  <105  i/3  = 0.35  H3  = 9 

INPUT  DATA 

1.  DSM  VALUE 

2.  POINT  BY  POINT  TABULATION 
OF  LOAD- DEFLECTION  CURVE 


WES  NONLINEAR  DYNAMIC 
LOAD- DEFLECTION  PROGRAM 


I 


OUTPUT  DATA 
k M C Es 


Es  = 22.8  * 103  PSI 


Figure  16.  Determination  of  subgrade  modulus  from  measured 
dynamic  load-deflection  curves 
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CHARACTERIZATION  OF  BEHIND  ARMOR  EFFECTS  FOR 
LONG  ROD  PENETRATORS 


Victor  D.  Maki 
Engineering  Branch 
Ballistic  Modeling  Division 
US  Army  Ballistic  Research  Laboratory 
Aberdeen  Proving  Ground,  Maryland 


ABSTRACT.  This  study  was  needed  to  provide  information  on  the 
behind  armor  effects  essential  to  armored  vehicle  analysis  and  in  the 
design  of  future  kinetic  energy  penetrators.  Both  spall  and  rod  pene- 
trator  fragment  data  was  examined  for  gross  characteristic  statistical 
trends.  Use  of  least  squares  was  employed  to  ascertain  causes  for  simi- 
larities in  the  data  base.  A linear  function  relating  fragment  mass  to 
velocity  was  employed  to  study  effects  of  variation  in  projectile  materials, 
initial  projectile  weights,  striking  velocities,  length  to  diameter  ratios 
and  plate  thicknesses.  Kolmogorov-Smirnov  type  test  statistics  were  used 
to  determine  whether  or  not  a unique  parent  weight  distribution  existed 
between  various  firings.  The  Weibull,  Poisson  and  Truncated  Normal  cumu- 
lative distribution  functions  were  also  compared  with  empirical  weight 
distributions  for  several  selected  firings.  This  paper  summarizes  the 
characteristics  found. 


1.  INTRODUCTION.  Whenever  armored  vehicles  of  any  kind  are  attacked 
by  metal  rod  penetrators,  fragments  are  spra/ed  inside  the  vehicle  which 
damage  components  and  personnel.  To  facilitate  a greater  understanding 
of  those  mechanical  processes  involved,  a gross  characterization  was  done 
that  includes  fragment  numbers,  mass  distributions,  and  spatial  locations 
behind  6.35  and  12.7  millimeter  rolled  homogenous  steel  targets.  The 
fragment  data  base  used  for  this  analysis  is  comprised  of  140  test  firings 
completed  at  the  BRL  in  1970.  In  the  data  base  projectile  weights,  length 
to  diameter  ratios,  projectile  material  types, target  plate  thickness, 
fragment  masses,  fragment  locations,  (see  Figure  1 ) , and  velocities  were 
found  recorded  in  a BRL  Memorandum  Report1.  This  data  was  transcribed 
onto  IBM  punched  cards  for  computer  reduction  and  analysis.  Initially, 
the  natural  log  of  fragment  mass  and  velocity  were  fitted  with  a first 
degree  polynomial  of  the  form,  y = a^  + a^x  where  y denotes  the  In  velocity 

parameter  and  x the  In  of  fragment  mass.  This  polynomial  fitting  and 
plotting  technique  was  later  published  by  the  author  as  a BRL  Systems 

^L.  Herr  and  C.  Gabarek,  "Ballistic  Performance  and  Beyond  Armor  Data  for 
Rods  Impacting  Steel  Armor  Plates,"  US  Army  Ballistic  Research  Labora- 
tories Memorandum  Report  #2575,  January  1976. 
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Programming  Bulletin2.  The  volume  of  linear  equations  and  plots  produced 
was  found  to  be  valuable  as  a convenient  index  in  a search  for  trends  from 
firing  to  firing  which  later  led  to  a zone  analysis  of  the  data.  A zone 
definition  and  the  results  of  the  zone  analysis  are  on  the  following  page. 

Zone  number  1 is  represented  by  the  innermost  circle  on  the  recovery 
media  surface  and  is  measured  by  an  angle  of  ten  degrees  with  respect  to 
the  shotline.  Zones  2-t.hru-5  are  defined  by  an  angle  increase  of  ten 
degrees  per  zone.  For  all  firings  the  shotline  was  orthogonal  to  the 
target  surface  plane.  Projectile  striking  velocities  were  in  the  900  to 
1500  meters/sec  range.  The  straight  line  function.  In  (fragment  velocity) 
a^  + a.  (In  fragment  mass)  when  fitted  on  a zone  per  zone  basis  revealed 

a distinct  trend.  As  zone  angle  increased,  the  slope  values,  a^'s  were 
more  negative  in  value.  This  agrees  with  the  basic  conservation  of  energy 
law  of  physics. 

2.  The  Weibull  Distribution  Function.  In  a testing  of  the  Poisson, 
Truncated  Normal,  and  Weibull  distribution  functions  the  latter  provided 
the  best  fit  to  the  fragment  mass  parameter.  A detailed  report  of  how 
the  Weibull  distribution  function  parameters  were  estimated  can  be  found 
in  Reference  3.  A two  sided  Kolmogorov-Smirnov  type  test  was  employed  as 
a criteria  for  best  fit.  The  empirical  cumulative  distribution  function, 
(Equation  1)  was  computed  for  the  fragment  mass  parameters  for  several 
selected  firings.  A graphical  and  numerical  comparison  with  the  Weibull 
cumulative  distribution  function,  (Equation  2)  was  then  performed. 

°,  x < x(1) 

K/N,  x^  <_  x < x 

X(N)  ±X 

xY 

F(x)  = 1.0  - exp  - (— ) 

9 


(1) 


(2) 


x,  is  the  fragment  mass  parameter. 


2 

Victor  D.  Maki,  "PDT  Plot  Subroutine  with  Bi-variate  Analysis,"  US  Array 
Ballistic  Research  Laboratories  Systems  Programming  Bulletin  #SPB-G-74, 
17  July  1974. 


3 

Victor  D.  Maki,  "Three  Probability  Density  Function  FORTRAN  Subroutines," 
US  Army  Ballistic  Research  Laboratories  Interim  Memorandum  Report  #396, 
June  1975. 
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The  computed  maximum  absolute  difference  was  numerically  compared  with, 

1.36//N  which  is  fully  described  in  Reference  4.  If  the  computed  maximum 
absolute  difference  was  found  to  be  less  than  the  above  statistic,  a 
decision  was  made  to  accept  the  Weibull  distribution  function  for  describing 
fragment  mass.  Included  in  this  paper  is  a plot  of  this  type  test  for 
round  number  5 (see  Figure  2).  For  "good"  fitting  of  the  Weibull  distribu- 
tion function  to  a large  number  of  firings  fragment  masses  greater  than 
100  grains  should  be  ignored.  Because  rod  penetrator  fragment  mass  distri- 
butions are  characteristically  bi-modal,  more  than  90  percent  of  the  frag- 
ments can  be  found  in  the  first  mode  and  therefore,  for  this  data  set, 
ignoring  the  second  mode  caused  no  significant  loss  in  accuracy. 

3.  ZONE  DEFINITION. 


Snotl ine 


v Armor  Plat 


Collection  Media 


Fragment  Impact 
Location 


Figure  1 

Zone  1 is  represented  by  the  innermost  circle  and  forms  an  angle  of 
ten  degrees  measured  from  the  shotline.  Zones  2-thru-5  are  defined  by 
an  angle  increase  of  ten  degrees  per  zone. 


W.  J.  Conover,  Practical  Non-Parametric  Statistics,  published  by 
John  Wiley,  1971,  New  York,  NY. 


MASS . ( GRS  * ) 


Figure  2 


1 . 36/^N  = .09866477 

Since  the  maximum  absolute  difference  is  smaller  than  the  above  value 
the  Weibull  distribution  is  accepted  for  this  firing. 
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The  fitting  of  the  Weibull  distribution  to  spall  fragment  mass  is  doc- 
umented in  Reference  5.  Further  interest  on  the  readers  part  on  this 
topic  should  be  directed  to  Mr.  John  Misey,  US  Army  Ballistic  Research 
Laboratory,  Aberdeen  Proving  Ground,  Maryland  21005. 


5,  CONCLUSIONS  OF  ANALYSIS. 

a.  As  zone  angle  increases,  average  fragment  velocity  decreases, 
numbers  of  fragments  decrease  and  average  mass  increases. 

b.  A two-sided  Kolmogorov-Smirnov  type  test  shows  the  Weibull  dis- 
tribution function  is  a good  choice  for  describing  fragment  mass  less  than 
100  grains. 

c.  Rod  penetrator  fragment  mass  distributions  are  characteristically 
bi-modal . 


"Behind  Armor  Data  for  Long  Rod  Penetrators , " paper  presented  by 
Mr.  John  Misey  at  the  Second  Annual  Automatic  Cannon  Caliber  Munitions 
Symposium,  25  September  1975  at  Frankfort  Arsenal. 
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MATHEMATICAL  MODELS  OF  SYSTEMS  AND  TACTICS  IN  LAND  COMBAT 


Roger  F.  Willis 

US  Army  TRADOC  Systems  Analysis  Activity 
White  Sands  Missile  Range,  New  Mexico  88002 


ABSTRACT.  This  paper  covers  a variety  of  mathematical  models  that  have 
recently  been  developed,  tailored  to  specific  decision  problems  in  tactics 
and  alternative  system  tradeoffs.  These  models  emphasize  rapid  and  flexible 
variation  of  assumptions,  investigation  of  alternative  tactics,  tradeoffs 
between  system  parameters,  tradeoffs  between  the  elements  of  a force  and 
various  optimizations.  Alternative  mathematical  formulations  include  linear 
versus  non-linear,  constant  versus  time-varying  coefficients  and  stochastic 
versus  deterministic. 

1.  INTRODUCTION 

Flexible  and  efficient  mathematical  models  are  required  for  use  in  different 
phases  of  a particular  force  evaluation  or  combat  developments  study.  In  an 
early  phase  these  models  can  be  used  to  compare  and  screen  alternatives  -- 
alternative  systems,  alternative  tactics  or  alternative  mixes.  In  late  phases 
the  same  models  can  be  used  for  sensitivity  analysis,  to  give  approximate 
answers  to  "what  if"  questions  --  i.e.,  to  determine  how  study  results  might 
change  if  certain  assumptions  are  varied.  In  most  studies  the  major  analyt- 
ical tool  will  be  a large,  relatively  slow  and  expensive  computer  model  or 
simulation  or  computer-assisted  wargame  (e.g.,  DIVWAG).  The  mathematical 
models  presented  in  this  paper  are  intended  to  supplement  the  large  models, 
to  provide  additional  insights  and  to  enrich  the  study  results.  These  models 
can  also  be  used  to  develop  hypotheses  (e.g.,  about  the  relative  merits  of 
alternative  tactics)  that  can  then  be  tested  with  high  resolution  stochastic 
simulations. 

2.  We  consider  the  class  of  models  consisting  of  sets  of  ordinary  differential 
equations  in  which  each  equation  represents  the  time  rate  of  change  of  the 
number  remaining  of  a particular  type  of  weapon.  The  equations  can  be 
deterministic  or  stochastic,  linear  or  non-linear,  with  constant  coefficients 
or  variable  coefficients.  We  consider  ten  specific  decision  problems  and 
the  mathematical  categories  of  models  as  follows: 

a.  deterministic,  linear,  constant  coefficients 

(1)  tradeoff  between  ground  forces  and  aircraft 

(2)  remotely  piloted  vehicles 

(3)  air  defense  suppression 


473 


(4)  optimum  artillery  mix 

b.  deterministic,  linear,  variable  coefficients 

(1)  antiamior  target  priorities 

(2)  optimum  disengagement  time 

(3)  electronic  warfare 

c.  deterministic,  non-linear 

(1)  weapon  effectiveness 

(2)  force  required 

d.  stochastic 

(1)  time  to  achieve  goal 

3.  More  complete  statements  of  the  decision  problems  are: 

a.  To  what  extent  can  tanks  be  traded  off  for  close  support  aircraft? 

b.  How  many  remotely  piloted  vehicles  are  required  to  support  one 
maneuver  battalion? 

c.  How  should  artillery  fire  be  allocated  between  counterbattery  fire 
and  suppression  of  air  defense? 

d.  What  is  the  optimum  mix  of  artillery  types  and  numbers  for  support- 
ing a mechanized  infantry  division? 

e.  How  should  tank  fire  be  allocated  between  three  or  four  types  of 
anti -armor  weapons? 

f.  What  is  the  optimum  time  for  a defending  anti-armor  force  to  disengage? 

g.  What  is  the  payoff  from  detecting  enemy  surveillance  systems  and 
attacking  or  jamming  them? 

h.  What  weapon  effectiveness  is  required  against  a given  enemy  force: 

(1)  if  replacements  are  available? 

(2)  if  no  replacements  are  available? 
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i.  What  force  size  is  required  against  a given  enemy? 


[ 

i 


I * * 

m 


j.  With  a given  force  available,  how  much  time  would  be  required  to 
reduce  an  enemy  force  to  a specified  level? 

4.  In  this  paper  we  will  present  models  for  only  four  of  these  decision 
problems:  a,  e,  i and  j.  These  particular  examples  were  selected  to  illustrate 

the  four  categories  of  models  and  several  different  measures  of  effectiveness. 

In  the  first  problem,  involving  tradeoffs  between  ground  and  air,  we  are 
interested  in  the  broader  question  of  what  mix  of  ground  forces  and  air  forces 
do  we  need  in  NATO?  What  factors  should  be  incorporated  in  a simple  model 
designed  to  give  gross,  order-of-magnitude  answers  to  this  question?  Some 
of  them  are:  aircraft  availability  rate  and  sortie  rate,  attrition  of 

aircraft,  allocation  of  aircraft  against  alternative  target  types  (e.g.,  tanks 
or  artillery),  lethality  of  air-delivered  weapons,  tank  effectiveness,  tank 
vulnerability  and  artillery  effectiveness.  The  model  is  presented  in  Figure  1, 
with  variables  and  factors  defined  as  follows: 

(t)  = Red  tanks 
X2(t)  = Red  artillery 
Y.|  (t)  = B1  ue  tanks 
Y2(t)  = Blue  artillery 
Y2(t)  = Blue  aircraft 

J = rate  at  which  a Blue  tank  can  kill  Red  tanks 

K = rate  at  which  a Red  tank  can  kill  Blue  tanks 

P = Blue  aircraft  attrition  rate  per  sortie  flown 
b = rate  at  which  a Blue  artillery  weapon  can  kill  Red  tanks 
k = average  number  of  Red  tanks  killed  per  aircraft  sortie 
s = sortie  rate,  per  available  aircraft 

V = aircraft  availability  rate,  taking  into  account  NORM,  NORS,  etc. 
r = replacement  rate  for  Red  tanks 

L = rate  at  which  a Blue  artillery  weapon  can  kill  Red  artillery 

M = rate  at  which  a Red  artillery  weapon  can  kill  Blue  artillery 

N = average  number  of  Red  artillery  weapons  killed  per  aircraft  sortie 
f = fraction  of  Blue  aircraft  sorties  employed  against  Red  tanks  (the  rest 
are  used  against  artillery) 

g = fraction  of  Blue  artillery  employed  against  Red  tanks  (the  rest  are  used 
against  artillery) 


\\ 


f 


« ; 
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RED  TANKS 

= - JY1  - bgY2  - VskfY 3 + r 
dt 

RED  ARTILLERY 

^2  = _ L(l-g)Y2  - VsN(l-f)Y3 
dt 

BLUE  TANKS 

= - KX, 
dt 

BLUE  ARTILLERY 

fli  - - MX, 
dt 

BLUE  AIRCRAFT 

= - VsPY, 
dt 

From  the  first  differential  equation  in  Figure  1 we  see  that  Red  tanks 
are  killed  by  Blue  tanks  (Y-,),  Blue  artillery  (Yp)  and  Blue  aircraft  (Yg). 

To  some  extent  Red  tank  losses  are  compensated  for  by  replacement  tanks, 
at  a rate  of  r per  minute.  The  Blue  commander  has  two  weapon  allocation 
problems:  allocation  of  available  aircraft  against  tanks  (f)  and  aqainst 

artillery  (1-f);  allocation  of  available  artillery  against  tanks  (g)  and 
against  artillery  (1-g). 

5.  The  solutions  of  this  model  express  the  numbers  of  weapons  of  each  type 
surviving  (and  commicted)  as  functions  of  time.  This  model  can  be  used 

to  investigate  tradeoffs  between  tanks  and  close  air  support  aircraft  in 
the  following  way.  We  set  a tactical  goal  and  calculate  the  various 
combinations  of  "number  of  tanks"  and  "number  of  aircraft" , each  of  which 
will  achieve  the  goal.  An  example  of  a goal  is:  "Reduce  the 
Red  tank  strength  by  100  within  2 hours.  The  tradeoff  curves  (tanks  versus 
aircraft)  will  usually  depend  on  the  values  assumed  by  all  the  other  factors 
in  the  model,  such  as  Red  tank  effectiveness.  Blue  tank  effectiveness.  Blue 
aircraft  attrition  rate,  number  of  Blue  artillery  tubes  available,  etc. 

The  tradeoff  curves  also  vary  with  the  type  of  goal  required.  "Reduce  the 
Red  to  Blue  tank  force  ratio  by  50%  in  6 hours"  would  give  different  curves. 

6.  For  example,  if  we  leave  out  Red  and  Blue  artillery  to  simplify  the 
calculations  and  make  the  following  assumptions 


J = 

.003 

S = .004 

K = 

.001 

P = .05 

V = 0.70 

k = 2 

we  get  the  following  results,  for  the  Blue  goal  of  killing  the  required 
number  of  Red  tanks  within  16-2/3  hours: 


Number  of  Red 

Number  of 

Blue 

tanks  killed 

aircraft 

tanks 

400 

100 

133 

75 

200 

50 

267 

300 

100 

33 

75 

100 

50 

167 

7.  The  tradeoff  between  Blue  tanks  and  Blue  aircraft  depend  on  two  major 
uncertainties:  the  duration  of  combat  and  the  ratio  of  Blue  tank  effectiveness 

(J)  to  Red  tank  effectiveness  (K).  We  see  this  directly  in  the  following 
results,  based  on  the  assumptions:  V = 0.70,  S = 0.004,  k = 2,  p = 0.05. 
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Ratio  of  Blue  tank 
effectiveness  to  Red 
tank  effectiveness 

Combat  Time 
(minutes) 

Number  of  Blue  tanks 
equivalent  to  one 
Blue  aircraft 

1 to  1 

2000 

8.5 

1000 

6.2 

500 

5.8 

3 to  1 

2000 

5.3 

1000 

2.7 

500 

2.2 

8.  For  the  next  decision  problem  (3e)  the  question  is:  How  should  Red 

tank  fire  be  allocated  between  three  or  four  types  of  Blue  anti -armor 
weapons  (targets)?  The  Red  side  makes  tactical  judgments  about  allocation 
of  fire.  Here  we  let  f,  be  the  fraction  of  Red  fire  directed  against  type  1 
Blue  weapons,  f2  the  fraction  of  Red  fire  directed  against  type  2 Blue 
weapons,  etc.  we  could  let  the  f.  factors  vary  with  time  during  the  battle, 
but  in  the  examples  given  here  we^ssume  that  for  a given  battle  each  f. 

is  given  a fixed  value,  with  the  sum  of  f.  equal  to  one. 

9.  The  ability  of  individual  weapons  to  kill  targets  (detect,  hit,  kill) 
is  assumed  to  vary  with  time  during  the  battle  (comparable  to  variations 
with  range  as  intervisibility,  detection  and  weapon  accuracy  change).  The 
model  represents  the  dynamics  of  combat  as  the  battle  progresses,  the  rates 
at  which  the  numbers  of  weapons  surviving  changes  due  to  attrition.  The 
model,  a set  of  N + 1 differential  equations,  is  given  below  in  para  10. 

The  factors  and  variables  are  defined  as  follows: 

X = number  of  Red  tanks 


number  of  type  1 Blue  weapons  (e.g.,  M60A1E3) 

number  of  type  2 Blue  weapons  (e.g.,  TOW  on  Ml  1 3 ) 

number  of  type  3 Blue  weapons  (e.g.,  TOW  on  jeep) 

number  of  type  4 Blue  weapons  (e.g.,  DRAGON) 


Yn=  number  of  type  N Blue  weapons. 

f..=  fraction  of  Red  tank  fire  allocated  against  type  i Blue  weapons 
(This  could  include  target  opportunities  as  well  as  target  priorities.) 

K.  (t)  = average  rate  at  which  a type  i Blue  weapon  can  kill  Red  tanks 

J 

(This  includes  engagement  opportunities,  hit  probability,  rate  of  fire  and  kill 
probability  given  a hit.) 

0.  (t)  = average  rate  at  which  a Red  tank  can  kill  type  i Blue  weapons 
We  assume  that  each  and  J.  is  a linear  function  of  time.  In  particular, 
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10.  The  model  is: 
Red  tanks  * 

dX_ 

dt 

Blue  weapons 


K.(t)  = a.  + b .t 
x x x 

J .(t)  = a . + d.t 
x xx 


K2(t)Y1-K2(t)Y2  -...-KN(t)YN 


dIi 

af'-Vw  fixltl 


dY„ 

Cl 

dt 


= - J2(t)  f2  X(t) 


W = - fB  x(t) 


11.  We  consider  five  alternative  tactical  allocation  schemes  for  the  Red 
tanks,  as  follows: 


a.  Initial  Blue  strength. 


/,  = 


Y.(o) 

x 


X & 

E Y . (0) 
0=1  3 


b.  Equal  priorities  by  target  type. 

f.  = - 
Jx  N 

c,  Initial  threat  to  Red  tanks. 


a . 

f.=JL 

N 


Z a . 
0=1  3 


d.  Initial  ease  of  killing. 


/,  = 


c . 

X 


x N 
Z 

o-i 


Z c . 

0 


l* 
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e.  Later  threat  to  Red  tanks  (time  t,  e.g.,  t = 5 or  10). 

a.  + b . t 

f%  Is  Is 


l (a  . + b . t) 

0=1  3 3 

12.  In  order  to  compare  these  Red  alternatives  we  must  make  assumptions 
about  the  initial  force  sizes  on  both  sides  and  the  coefficients  representing 
weapon  kill  capabilities.  In  a number  of  runs  we  used  the  following  values: 


Blue  weapons 

Blue  versus  Red 

Red  versus  Blue 

a . 

Jl_ 

b. 

% 

a . 

d. 

type  1 (tanks) 

.152 

.163 

.013 

.015 

type  2 (long  range  ATGM) 

.053 

.270 

.008 

.040 

type  3 (short  range  ATGM) 

.000 

.600 

.000 

.060 

V/i th  overall  Red  to  Blue  initial  force 
4 to  1 the  order  of  preference  for  the 

ratios  on  the  order 
Red  alternatives  in 

to  3 
para 

tc 

11 

1 or 
turned 

out  as  fol lows: 


Best:  e. 

a. 

b. 
d. 

Worst:  c. 


later  threat  to  Red  tanks 
initial  Blue  strength 
equal  priorities  by  type 
initial  ease  of  killing 
initial  threat  to  Red  tanks 


13.  In  the  next  decision  problem  (3i)  we  consider  the  tactical  question 
of  how  a given  initial  Blue  force  should  be  broken  up  into  smaller  units 
for  employment  against  the  enemy.  If  the  effectiveness  of  the  defending 
Blue  force  does  not  depend  on  the  absolute  scale  of  the  battles  fought 
by  the  units  (but  only  on  the  force  ratio)  then  it  might  not  matter  how 
the  initial  force  is  broken  up  into  units.  We  have  investigated  many 
alternative  types  of  models  with  respect  to  this  question.  Here  we  present 


results 

for  four 

of  them: 

Model  A 

ii 

- KB(t) 

ft  ‘ - *"*> 

Model  B 

ii 

Model  C 


~ = - LB(t)R(t)  + cLR(t) 
at 

§£  = - JR(t)B(t)  + bB(t) 

Model  D 

— - (p  + qt)B(t) 

^ = - (a  + bt)R(t) 

14.  In  evaluating  combat  of  maneuver  units  the  single  most  meaningful  measure 
of  effectiveness  is  the  cumulative  loss  ratio — i.e.,  the  ratio  of  total  Red 
losses  to  total  Blue  losses.  This  loss  ratio  will  depend  on  many  factors, 
including  (in  most  cases)  the  initial  force  ratio--the  ratio  of  the  initial 
number  of  Red  weapons  to  the  initial  number  of  Blue  weapons.  If  we  cal- 
culate the  cumulative  loss  ratio  at  the  particular  time  t at  which  Blue 
has  a fraction  "A"  of  his  force  surviving  (e.g.,  A = 0.70)  the  result  for 
Model  A is:  ■, 

F * -1  Oll-A h 1 


L(t)  = 


where  F0  is  the  initial  force  ratio  ( and  c = y,  the  ratio  of  individual 
weapon  effectiveness  coefficients.  It  ts  clear  that,  for  Model  A,  L does 
not  depend  on  the  scale  of  the  battle  (B0  or  Rc)  but  only  on  the  initial 
ratio  of  forces. 


15.  For  Model  B,  the  loss  ratio  is: 


Lft>  ■ fir 


N+l 

1 ~ "\  1 


M+l  M-N 
a (1  - A )R„ 

„ M+l 


If  M does  not  equal  N then  L does  depend  on  the  scale  (Rc)  but  if  M = N 
then  it  does  not. 

16.  The  cumulative  loss  ratio  L satisfies  the  following  equation  when 
B(t)  equals  ABC  for  Model  C: 


b log  ( F0  - (1  - A)L]  + J(1  - A)  B0  L = 
b log  F0  + a log  A + K(1  - A)  B0 


Since  Bc  appears  explicitly  the  loss  ratio  does  depend  on  scale  for  Model  C. 
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17.  Based  on  the  Taylor  series  solutions  of  Model  D,  the  cumulative  loss 
ratio,  at  any  time  t,  is: 


L(t)  = 


pt  - 2 (paFr 


t3  2 

i)  + J [P  a -(pb+ZgalFp] 


2 6 2 

aF0t  - t_  (ap  - bF0)  + t_  j-a  pFa  - (aq  + 2bp 
2 6 


This  expression  depends  on  F0  but  not  on  B0  or  R0  (and  hence  not  on  the 
scale  of  battle). 

18.  The  final  decision  problem  to  be  illustrated  in  this  paper  is  3 j : 
with  a given  force  available,  how  much  time  would  be  required  to  reduce 
an  enemy  force  to  a specified  level?  For  example.  Red  has  six  tank  platoons 
and  Blue  has  three  tank  platoons.  How  long  would  it  take  Red  to  reduce 
Blue  to  65%  of  his  initial  strength  (e.g.,  with  about  2 platoons  left)? 

These  numbers  are  too  small  for  stable  results  from  a deterministic  model. 
Thus,  we  consider  the  following  stochastic  model,  developed  by  Isbell  and 
Marlow.  At  time  t,  the  probability  that  exactly  R Red  units  and  exactly  B 
Blue  units  are  surviving  is: 

P(Rj  B ; ROJ  B0,  t) 

where  R0  and  B0  are  the  initial  strengths.  If  f and  g are  transition 
probabilities,  in  small  increments  of  time,  for  Red  and  Blue  respectively, 
then  it  is  assumed  that  the  function  P satisfies  the  following  differential 
equations: 

%(R’B)  = f (R+l,  B)  P(R+1,B)  + g(R,B+l)P(RyB  + 1)  - [/?/?,  B)  + g(R,  BjJ  P 
(R,  B) 

where  P(R , B , t)  = 0 if  R > R0  or  B > Ba 
and  P(R0 , Boy  0)  = 1. 

If  we  assume  that  f and  g are  linear  functions  of  weapon  characteristics 
and  that  Red  and  Blue  weapon  are  equally  capable,  then  the  solutions  are 
as  given  in  Figure  2,  where  the  functions  F satisfy  the  following  relations: 

F(R,  B;  ROJ  B0)  = - - Bb  + - F(R  + 1,  B ; Roy  BJ 


R + B + 1 


F(R,  B + 1;  Roy  B0) 


FIGURE  2 
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19.  Examples  of  specific  solutions  are  the  following; 
a. 


Initial  Red  force; 
Initial  Blue  force; 


Probability 


6 platoons 
3 platoons 


.63 

.56 

.42 

.16 


Time  required  to  reduce  Blue  force 
by  65% 

(t  minutes  or  less) 


b. 


Initial  Red  force:  4 platoons 

Initial  Blue  force:  2 platoons 


Probability 

.42 

.32 

.19 

.06 


Time  required  to  reduce  Blue  force 
by  65% 

(t  minutes  or  less) 

7 

5 

3 

1 


« 1 

.V 

<V 

u 

w 

l4 
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ABSTRACT.  A shooting  method  is  the  superposition  of  initial  value  solutions 
of  ordinary  differential  equations  such  that  boundary  are  "met"  or  a performance 
index  is  minimized. 

The  results  of  meeting  noisy  boundary  conditions  In  least  squares  and  minimax 
norms  are  presented.  The  example  problem  is  a damped,  forced  harmonic  oscillator. 

The  procedures  are  basic  to  system  identification  problems. 

1.  INTRODUCTION.  The  linear  boundary  value  problem  is  governed  by  the 
ordinary  differential  equation 


y ~ ^y  + f (l) 

where  y and  y are,  respectively,  the  vector  of  n state  variables  and  its  deriva- 
tive, L and  / are  matrix  and  vector  functions  of  the  independent  variable  t,  time. 
The  solution  of  this  differential  equation  is  subject  to  a set  of  boundary  condi- 
tions 


q = bi  i = (2) 

where  <?£  is  the  boundary  condition  operator  that  specifies  a linear  combination 
of  the  state  variables  equal  to  the  boundary  value,  b.,  at  time,  t.. 

A shooting  method  is  to  superimpose  appropriately  independent  solutions  of 
(1).  This  can  be  written  as 

y = l p(j)a.  (3) 

3=0  3 

where  p(3)  is  a particular  solution  of  (1)  and  a.  is  the  corresponding  superposi- 
tion constant.  3 
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rj?"%  •**?.- .*  '/y*  : 


J 


(0) 


The  Independence  properties  can  be  assured  by  the  following  strategy.  Assume 
(0)  = a.  Then  take 


(o) 


(O)  = a . + 6 . .6  . 

v 3 


i*,  0  2  3 2|  2, • • • ,fl 


(4) 


where  6 is  the  Kronecker  delta  and  all  e.  / 0.  The  above  strategy  gives  a deter- 

3 

minant  (Wronskian)  of  the  associated  homogeneous  differential  equations,  at  t=0 , 
of  the  product  of  the  e's. 

The  superposition  of  particular  solutions  also  requires  that 


r 

%,\<4 

I . 

ft 


l a-j  = 1- 
o=o  J 


(5) 


Childs  et  al . [1970]  give  more  details  on  the  strategy  and  a proof  of  (5).  If  a 
is  the  initial  value  vector  that  makes  (1)  satisfy  (2;  then  it  is  obvious  that  aQ=l 

and  an=a0=...=a  =0.  How  close  the  actual  superposition  constants  come  to  these 

la  YL 

values  is  an  indication  of  the  merit  of  the  numerical  method. 

The  superposition  (3)  is  substituted  into  the  boundary  conditions  (2).  If  the 
boundary  conditions  are  linear  in  yt  then  the  operators  q and  z may  be  interchanged 
giving 


V.i 


l a.-b. 


0=0 


X-  — 1 ) 2 j • • • jin • 


(6) 


The  use  of  shooting  procedures  results  in  the  particular  solutions,  p J , being 
known  and  we  observe  that  the  bracketed  terms  in  (6)  are  simply  coefficients  of  an 
algebraic  equation  in  the  unknown  superposition  constants.  We  write  (5)  and  (6)  as 
the  matrix  equation 


Sa  = d 


(7) 


where 

and 


S0J  = ij 


do  1 


(j), 


si,i  - 


d.  - b. 


i - 1,2,..  .,m 
0 ~ 0, 1 , . . . . 


(8) 


n 


2.  NONLINEARITIES.  If  the  differential  equation  is  nonlinear  then  we  may 

write  it  as 


y = g(y,t). 


(9) 


Equation  (9)  is  linearized  via  a Taylor  series  expansion  in  order  to  obtain  an 
equation  linear  in  z; 


n 


z = g(w,t)  + 


[$L 


( 8 - W ) 


(10) 


where  w is  a reference  solution  or  a previous  approximation  to  y.  Equation  (10) 
may  be  rewritten  as 
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3 = «7z  + g ' 

where  j = |*2L  and  g'  = g(w,t)  - Jw  . 

Therefore,  if  we  are  given  a nonlinear  differential  equation  subject  to 
boundary  conditions,  we  may  approximate  it  by  the  linear  equation 

z = Jz  + g ' 


subject  to  the  boundary  conditions 

m bi 


i = 7,  2, . . ,,m. 


It  is  obvious  that  (13)  and  (14)  are  analagous  to  (1)  and  (2)  and  so  we  proceed 
in  the  same  fashion.  The  only  difference  is  that  now  the  solution  is  obtained 
iteratively. 

We  again  superimpose  n+i  particular  solutions  of  (13)  that  meet  the  boundary 
conditions  (14): 

I P(i>a.  (15) 


where  each  p {J  satisfies 

i>(o)  = Jp(;i)  +g*. 

The  superposition  constants  are  determined  by 

n 

i = * 

3=0  J 


l (p  J l(t.))a.)  = bi 
1 3=0  v J 


t — 2. j 2 j ••  • jTTl'> Yl  • 


If  the  operators,  q.,  are  linear  then  (17)  and  (18)  form  a set  of  linear  equations 
analagous  to  (7).  1 

If  any  of  the  boundary  condition  operators,  are  nonlinear,  then  they  must 
also  be  linearized  by  a Taylor  series  expansion.  The  linearization  is  done  with 
respect  to  the  superposition  constants  with  the  initial  reference  values  of  the 
vector,  a,  as 


ao~  i 

ar° 


3 7,2,  .. » 3yi  . 


The  reader  can  refer  to  Childs  et  al.  [1970]  and  to  Roberts  and  Shipman  [5] 
for  more  details  on  these  linearization  procedures. 
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3.  OVERDETERMINED  BOUNDARY  CONDITIONS.  If  the  number  of  boundary  condi- 
tions, m,  is  greater  than  the  order  of  the  differential  equation,  n , then  (7) 
constitutes  an  overdetermined  set  of  linear  equations  with  the  a.'s  unknown. 

Not  all  of  the  equations  can  be  met  exactly,  some  will  have  to  be  met  in  a 
"best  fit"  sense.  Let's  assume  that  p of  the  m boundary  conditions  are  to  be 
met  exactly,  then  p +1  of  the  m+l  equations  (the  superposition  condition  is 
included)  must  be  met  exactly.  Equation  (7)  may  be  partitioned  as  follows; 


L« 

1 

-V 

kJ 

The  components  s 2 and  dg  correspond  with  the  equations  to  be  met  exactly. 

By  suitable  matrix  operations,  (20)  can  be  transformed  into 


(21) 

(22) 

(23) 


i 

5 2 

a 

e 

s 

rd'i 

e 

0 

-ac- 

d' 

- oJ 

Two  matrix  equations  results  from  (21): 

S a a = d ' 

4 o o 


a = d' 
e e 


S2  ao 


Equation  (22)  is  solved  in  a "best  fit"  sense  for  a , which  is  then  substituted 

into  (23)  for  a . 0 

& 

Once  the  superposition  constants,  aj  , are  found,  they  are  multiplied  by 

their  appropriate  particular  solutions  at  t=0 , that  is,  which  yields 

an  estimate  of  y(0)t  i.e. 


y(0) 


l P(d)(0)a.  . 
0=0  3 


(24) 


If  the  problem  is  nonlinear,  the  LHS  of  (24)  is  taken  as  the  unperturbed  par- 
ticular solution  at  t=0t  (o) . Independent  perturbed  solutions  are  generated 
by  the  strategy  described  in  (4)  and  a new  set  of  superposition  constants  found. 

The  method  is  repeated  until  convergence  of  consecutive  p(0*  (0)  vectors  are 
observed  (i.e.,  aQ  will  approach  unity  and  all  other  a.'  s will  approach  zero). 

There  are  two  principal  methods  of  solving  overdetermined  systems  of 
linear  equations.  They  are: 


1)  least  squares  solution 

2)  minimax  or  Chebyshev  solution. 

a.  Least-Squares  Solution:  Given 


s'  a = d ' 

4 o o 
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(25) 


• 


* 


the  residual  vector  can  be  written, 

R - s'  a - d ' . 

4 o o 


(26) 


The  least-squares  solution  is  the  vector,  that  minimizes  the  sum  of  the 
squares  of  the  components  of  the  residual  vector,  R,  is: 

= (S4)Ts4~1(S4)Td,Q  • (27) 

This  is  substituted  back  into  (23)  to  find  a&  . 


b.  Minimax  Solution:  The  minimax  solution  is  the  vector  aQ  which  minimizes 
the  largest  absolute  value  of  the  components  of  the  residual  vector  (26).  That 
is,  we  want  to  minimize  max (r^r^,..., rm_^)  • T^e  met^od  advanced  by  Powell  is 

used  in  the  program.  See  [3]  and  [4]  for  more  specifics  on  the  minimax  method. 
4.  RESULTS.  We  considered  the  following  problem 

x + px  + Z,x  = sin(t)  (28) 


which  is  the  equation  of  motion  for  a forced,  damped  harmonic  oscillator. 
By  the  change  of  variables 

y2=x»  y 2 = x*  ^3  = Vj  y4  = £ 

(28)  may  be  replaced  by 

y2  = y2 

y2  = -y4y:  -y3y2  + sin(t) 
y4  * o . 


(29) 


(30) 


Initial  values  were  selected  for  the  state  variables  and  solutions  for 
y and  y were  generated  on  the  interval  0<t<l5.  At  times,  t=lt2,3t .. .,15,  the 
value  of  y2  was  observed.  These  values  were  taken  as  the  exact  boundary  values 
(to  8 significant  figures).  Six  sets  of  "noisy"  data  were  produced  by  two 
techniques.  The  first  was  to  round  off  the  exact  boundary  values  to  1,2,  and 
3 decimal  places  to  the  right  of  the  decimal  point.  The  second  technique  was 
to  add  a random  variable  that  was  normally  distributed  with  a mean  of  zero  and 
a standard  deviation,  a.  Three  different  values  of  a were  used:  .01,  .1,  and 
.5.  See  Table  1 for  the  sets  of  boundary  values.  The  program  was  run  using 
each  set  of  the  corrupted  boundary  values  as  data.  The  errors  between  the 
originally  selected  initial  conditions  and  those  that  the  computer  estimated 
from  the  noisy  data  were  computed.  Both  least-squares  and  minimax  were  employed 
to  solve  the  overdetermined  system,  for  each  data  set.  See  Tables  2,  3,  4 
and  5.  Two  criteria  were  chosen  as  the  basis  for  evaluating  the  "closeness" 
of  fit:  (1)  the  sum  of  the  absolute  values  of  the  errors  and  (2)  the  sum  of  the 
squares  of  the  errors.  In  all  cases,  the  least-squares  solution  proved  the 
better  fit,  as  expected.  The  accuracy  of  the  parameter  estimates  is  impressive, 
even  with  the  noisiest  data. 
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Boundary  values 

Boundary  values 

Boundary  values 

rounded  to  3 

rounded  to  2 

rounded  to  1 

» ti 

decimal  places 

decimal  places 

decimal  place 

1 

-0.220 

-0.22 

-0.2 

2 

0.35D-01 

0.3D-01 

0.0 

3 

-0.474 

-0.47 

-0.5 

4 

-0.589 

-0.59 

-0.6 

5 

0.393 

0.39 

0.4 

6 

1.597 

1.60 

1.6 

P 7 

1.452 

1.45 

1.5 

8 

-0.388 

-0.39 

-0.4 

9 

-2.324 

-2.32 

-2.3 

10 

-2.274 

-2.27 

-2.3 

11 

0.88D-01 

0.9D-01 

0.1D-00 

12 

2.711 

2.71 

2.7 

13 

2.997 

3.00 

3.0 

14 

0.401 

0.40 

0.4 

15 

-2.816 

-2.82 

-2.8 

i 


1 ' 


i 


i 


ft  • 


I 

£ 

« 

V 

A1 


Boundary  values  Boundary  values  Boundary  values 
with  N(0, .01)  with  N(0,.l)  with  N(0,.5) 

r.v.  added  r.v.  added  r.v.  added 


1 

-0.21869550 

-0.20667165 

2 

0.37798268D-01 

0.66064361D 

3 

-0.46666120 

-0.40497946 

4 

-0.59655859 

-0.66370844 

5 

0.39299204 

0.39030898 

6 

1.5960593 

1.5846639 

7 

1.4712304 

1.6443249 

8 

-0.38313658 

-0.33635166 

9 

-2.3169182 

-2.2529945 

10 

-2.2631763 

-2.1656611 

11 

0.91826552D-01 

0.12353162 

12 

2.6876393 

2.4736753 

13 

3.0007328 

3.0323136 

14 

0.39058453 

0.29953824 

15 

-2.8133056 

-2.7851188 

-0.15323232 

0.19169151 

-0.13083809 

-0.96215247 

0.37838423 

1.5340173 

2.4136342 

-0.12841859 

-1.9688892 

-1.7322599 

0.26444312 

1.5227239 

3.1726727 

-0.10511233 

-2.6598437 


TABLE  1 


:#T2*sr 


True  value 
of  Initial 
conditions 

I.C.  estimates  using 
Least  Squares.  B.V.’s 
input  with  8 signifi- 
cant figures 

I.C.  estimates 
using  Minimax.  B.V.'s 
Input  with  8 significant 
figures 

X(0) 

1.0 

0.99999999 

0.99999997 

Uo) 

0.5 

0.49999999 

0.49999998 

u 

0.2 

0.20000000 

0.20000000 

K 

1.0 

1.00000000 

1.00000000 

y 


CO 

o 

1 

o 

o 

• 

C\J 

II 

0) 

II 

-P* 

• 

O 

0 

1 

o 

CD 

l e\  = 2.0D-16 

I e\  = 1.0D-15 

. i 

vj 


fi 

-- 

itt 


it* 

;) 


TABLE  2 
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I.  C.  estimates  using 
Least  Squares.  B.V.'s 
are  rounded  to  3 deci- 
mal places 


I.  C.  estimates  using 
Minimax.  B.V.'s  are 
rounded  to  3 decimal 
places 


1.0000524 


0.99976973 


0.49978310 


0.49976451 


0.20004417 


0.20009774 


0.99996730 


0.99998927 


l\e. | = 3.4617D-04 


l\e.\  = 5.7423D-04 


l e.  = 5.2812D-08 

L x 


y e.  =11.81480-08 

L x 


I.  C.  estimates  using 
Least  Squares.  B.V. 's 
are  rounded  to  2 deci- 
mal places 


I.  C.  estimates  using 
Minimax.  B.V. 's  are 
rounded  to  2 decimal 
places 


1.0011527 


1.0008828 


0.50086499 


0.50155621 


0.19989372 


0.19996940 


1.0001414 


0.99988326 


£|e.|  = 2.2654D-03 

7/ 


l\e.\  = 2.5864D-03 


y = 2. 1082D-06 

L x 


y e.  = 3.2157D-06 
L x 


TABLE  3 


X,  >'■ 


t 


X 


I.C.  estimates  using 
Least  Squares.  B.V.'s 
are  rounded  to  1 deci- 
mal place 


I.C.  estimates  using 
Minimax.  B.V.'s  are 
rounded  to  1 decimal 
place 


X(0) 

0.95849170 

0.93474852 

X(0) 

0.50061069 

0.48097522 

u 

0.20243731 

0.20584127 

5 

0.99890736 

0.99848235 

\\ei\  = 4.5649D-02 

l\e.\  = 9.1635D-02 

l e2.  = 1.7304D-03 

L t 

l e2  = 4.6561D-03 

L t 

I.C.  estimates  using 
Least  Squares.  N(0,.01) 
r.v.  added  to  the 
B.V.'s 

I.C.  estimates  using 
Minimax.  N(0,.01)  r.v 
added  to  the  B.V.'s 

X(0) 

1.0009066 

0.99134015 

X(O) 

0.49193995 

0.48700959 

V 

0.20106603 

0.20179481 

5 

1.0003406 

0.99973471 

l\e.\  = 9.7755D-03 

l\ei\  = 23.7104D-03 

l e2  = 6.7049D-05 

L ^ 

V e2.  = 24.7035D-05 

L 'l 

TABLE  4 


» 

ii 


A 


r. 

I.C.  estimates  using 
Least  Squares.  N(0,.l) 
r.v.  added  to  the 
B.V.'s 

I.C.  estimates  using 
Minimax.  N(0,.l)  r.\ 
added  to  the  B.V.'s 

X(0) 

1.0112150 

0.91127941 

X(0) 

0.41910446 

0.36854682 

! ' 

V 

0.210717C 5 

0.21777510 

[' 

5 

1.0033087 

0.99682169 

l\*i\  " -1061 

[1^1  = .2411 

• 

I < . 

y = 6.7957D-03 

y e2.  = 2.5477D-02 

L ^ 

I.C.  estimates  using 
Least  Squares.  N (0 , . 5 ) 
r.v.  added  to  the 
B.V.'s 

I.C.  estimates  using 
Minimax.  N (0, .5)  r. 
added  to  the  B.V.'s 

X(0) 

1.1020264 

0.49280864 

1 , 
; ; 

X(0) 

0.90461984D-01 

-0.18505768 

V 

0.25470622 

0.28377993 

c 

1.0143196 

0.97314194 

. £ 
i *■ 

' 

l\*i\  = -5809 

l\e.\  = 1.3029 

\ 

i* 

r 

1 * 

fc 

1 

1 

* 

y e2.  = .1813 

y e?  = .7343 

U t 

i \ 

TABLE  5 
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ABSTRACT.  An  extension  of  regression  analysis  from  the 
usual  algebraic  models  to  differential  equation  models  is  given. 

A shooting  method,  superposition  of  appropriately  independent 
initial  value  solutions  of  differential  equations,  is  used.  The 
shooting  method  used  is  based  on  particular  solutions  of  the  gov- 
erning differential  equations.  Nonlinear  differential  equations 
and/or  boundary  conditions  can  be  accommodated. 

The  statistics  of  linear  regression  are  generated  through 
a straightforward  analysis  of  variance.  These  provide  the  basis 
of  "acceptance"  or  "rejection"  of  the  regression. 

The  statistics  generated  include  an  (uncorrected)  ANOVA 
tables,  general  F-test  on  the  regression,  R^  value,  the  coefficient 
of  variation,  covariance  matrix  of  the  superposition  constants, 
estimate  of  the  variance  about  the  regression,  estimate  of  the 
variance  of  the  parameters,  and  the  confidence  intervals  of  these 
estimates. 


1.  INTRODUCTION.  The  linear  boundary  value  problem  is  governed  by  the 


ordinary  differential  equation 

y = Ly  + f (1) 

where  y and  y are,  respectively,  the  vector  of  n state  variables  and  its  deriva- 
tive, L and  f are  matrix  and  vector  functions  of  the  independent  variable  t, 
time.  The  solution  of  this  differential  equation  is  subject  to  a set  of  boundary 
conditions 

q .(y(t .) ) = b . 1=1,2,...,  m>n  (2) 

where  is  the  boundary  condition  operator  that  specifies  a linear  combination 
of  the  state  variables  equal  to  the  boundary  value,  b-,  at  time,  t. . We 

fc-  Is 

are  concerned  only  with  those  cases  where  m>n  and  the  boundary  conditions  are 
to  be  met  in  a least  squares  sense. 

A shooting  method  is  to  superimpose  appropriately  independent  solutions  of 
equation  (1).  This  can  be  written  as 

y - l P(ij)a.  (3) 

0=0  3 


where  p js  a particular  solution  of  (1)  and  a.  is  the  corresponding  super- 

0 

imposition  constant. 

The  independence  properties  can  be  assured  by  the  following  strategy. 
Assume  (0)  = a.  Then  take 


^ (0)  ~ ai  + ^ij^j  3 - 1»  2,...,  n 

where  & is  the  Kronecker  delta  and  all  3 . £ 0.  The  above  strategy  gives  a 

V 

determinant  (Wronskian)  of  the  associated  homogeneous  differential  equations, 
at  t=0 , of  the  product  of  the  3's. 


(4) 
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The  superposition  of  particular  solutions  also  requires  that 


n 

l a,  = 1.  ( 

3=0  3 

Childs  et  al.  [1970]  give  more  details  on  the  strategy  and  proof  of  (5). 

If  a is  the  initial  value  vector  that  makes  (1)  satisfy  (2)  then  it  is  obvious 
that  cLq  = 1 and  a ^ = a ^ =...=  a n = 0.  How  close  the  actual  superposition 
constants  come  to  these  values  is  an  indication  of  the  merit  of  the  numerical 
method. 

The  superposition  (3)  is  substituted  into  the  boundary  conditions  (2). 

If  the  boundary  conditions  are  linear  in  y,  then  the  operators  q and  £ may  be 
interchanged  giving 


l 'JJP'’  <V;  Un  = bi 

i=n  x i 3 v 


i = ly  2y...  ,ra. 


The  use  of  shooting  procedures  results  in  the  particular  solutions,  p 3 , being 
known  and  we  observe  that  the  bracketed  terms  in  (6)  are  simply  coefficients 
of  an  algebraic  equation  in  the  unknown  superposition  constants.  We  write 
(5)  and  [6)  as  the  matrix  equation 


where 


Sa  = d 

S0J  - 3-  do  - 1 


and  S.  . = q . (p  3 J (t .) ) y d.  = b.  i = 1,  2,.,.,  m 

lj3  t 1 X 'L  3 = 0y  ly...y  n. 

2.  THE  OVER-DETERMINED  SYSTEM.  The  solution  of  the  system  (7)  is 
easily  obtained  for  the  data  sets  we  have  considered.  We  rewrite  (7)  parti- 
tioning (and  rearranging  if  necessary)  the  elements  of  the  vectors  and 
matrices 


The  equality  sign  is  understood  to  mean  "equality"  (as  much  as  our  numerical 
procedures  allow)  for  the  upper  portion  of  (8)  and  "least  squares"  fit  for 
the  lower  portion.  The  equality  conditions  come  from  the  superposition 
constraint  (5)  and  any  boundary  conditions  that  may  exist  which  should  be  met 
"exactly."  Weighting  of  the  rest  of  the  boundary  conditions  can  be  done 
but  is  not  shown. 

A straightforward  method  of  solution  is  by  elementary  operations 
(Gaussian  reduction  with  maximum  pivot  selection)  to  transform  (8)  into  the 
equality  (9)  and  the  "least  square"  fit  (10) 


+ 'S2al  ~ 'de 

(9) 

>S4al  = 'dl 

(10) 

The  '()  denotes  the  values  have  been  affected  by  the  reduction  process.  Note 
that  'Sj  = I and  ' S ^ = 0. 

The  least  square  solution  for  a in  (10)  is  obtained  in  the  usual  manner, 

the  normal  equations  result  from  premultiplying  by  the  transpose  of  ' S The 

result  is  substituted  into  (9)  to  obtain  the  rest  of  the  a vector. 

The  "correct"  initial  value  vector  can  be  calculated  from 

y(0)  « p(0)  (0)  + Ba  (11) 

where  B is  a diagonal  matrix  with  indices  varying  from  0 through  n.  B = 0} 

B..  = 8.  for  i = n.  Our  computational  procedure  is  to  repeat  the 

process  with  (0)  = y(0 ) from  (11)  such  that  we  should  have 

a.  = 0 j = lj  2, ...  j n (12) 

<7 

This  will  aid  in  construction  of  confidence  limits  of  parameter  estimates. 

This  also  gives  a convenient  quantity 

ii-ii<?(0><*i>>  -h-  H (13) 

which  is  the  "predicted"  boundary  value. 
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3.  ANALYSIS  OF  VARIANCE.  An  uncorrected  ANOVA  table  is  presented  below 
in  terms  of  the  nomenclature  introduced.  The  significant  calculations  are 
presented  in  terms  of  vector  products.  These  products  are  over  the  q and 
b vectors  and  are  formed  over  the  elements  associated  with  least  squares 
boundary  conditions  only , any  exact  boundary  conditions  are  ignored  in  these 
products. 


TABLE  1 


ANOVA  TABLE 


Source 

Sum  of 
Squares 

Degrees  of 
Freedom 

Mean 

Square 

Due  to  regression 

-T\ 

q b 

, T,  -T, 

n 

SS/n 

2 

About  the  regression 
(residual) 

b b - q b 

m-k-n 

s 

Total  (uncorrected) 

T 
b b 

m-k 

Notice  in  the  degrees  of  freedom  column  that  m is  the  total  number  of  boundary 
conditions  and  k of  those  are  to  be  met  exactly. 

We  define  b to  be  the  mean  of  the  least  square  boundary  values 

b = (l  b .)/ (m-k) 


(14) 


The  following  formulae  are  used  to  calculate  the  usual  statistics: 


R2  - 1 ; >1 


(15) 


bTb  - "h2  (m-k) 


s = MS  (residual)  = estimated  variance  of  system 

2 


F cal  = (regression) /s 


(16) 

(17) 


501 


I 


j 


In  (15)  the  summation  and  product  are  over  the  least  square  boundary  conditions. 
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The  F , value  must  exceed  a Fischer's  F with: 
cal 

Probability  of  1 - a (a  is  the  producers  risk) 

Numerator  degrees  of  freedom  n 
Denominator  degrees  of  freedom  m-k-r 
for  the  regression  to  be  accepted. 

The  estimated  variances  of  the  boundary  values  are: 

est.  var.  (q£)  = CS H)  [ ( 'SJ  ( 'S^  ] ~2  ('SH)T  82 
where  the  i subscript  denotes  the  ith  row  of  the  ' S ^ matrix.  The  resulting 
confidence  limits  in  terms  of  the  t statistic  are: 
q . + t( y , 1 - cl/2J  \/[e8t.  var  (q.)] 
where  y = m-k-n  and  a is  the  producer's  risk. 

We  have  stated  these  procedures  are  basic  to  system  identification 
procedures.  We  are  most  interested  in  y(0)  and  its  covariance  in  those  cases. 
Recall  equations  (9)  and  (10)  and  we  denote 
r2  = Cd^Cd^/fm-n) 

The  covariance  matrix  of  a ^ is 

ci  = ics^es^v2 

Likewise,  the  covariance  matrix  of  a is 

e 

Ce  . rs2>  rs/ 

The  final  covariance  matrix  is  formed  by  appropriate  multiplying  by  the 
perturbation  matrix  B giving 


a 


The  zero th  row  and  column  of  the  result  are  null  reflecting  the  variance  of 
the  superposition  constraint  (5).  The  ith  diagonal  element  is  the  estimated 


variance  of  yAO).  Its  square  root  is  the  estimated  standard  deviation  of 
y.(0).  The  confidence  limits  (which  Draper  and  Smith  point  out  should  be 
viewed  with  caution)  are 

y .(0)  +_  t(m-n , l-a/2)  [est.  std.  dev.]  . 

"V  Is 

A more  stringent  confidence  limit  would  be  a hyperellipsoid  like 
eT (C  ) ^ e < n r^  F(n.  m-n . 1- a) 

y 

where  the  vector  e is  within  the  confidence  limits  about  y(0).  This  kind  of 
statement  is  difficult  to  use  if  n>2. 


4.  AN  EXAMPLE.  Consider  the  problem  of  determining  the  coasting  dynamics 
of  an  automobile.  The  three  force  elements  of  a model  of  the  phenomenon  are  [5] 

a.  Rolling  friction  due  to  tire  flexing  and  Coulomb  friction  on 
gear  train. 

b.  Aerodynamic  resistance  proportional  to  the  square  of  velocity. 

c.  Product  of  mass  and  deceleration. 

The  differential  equation  may  be  written  as 

x + C^AfP  + Vj,  g = o (2i 

M2  7 

where 


p = air  density  (slugs/ft  ) 

2 

A^.  = frontal  area  of  vehicle  (ft  ) 

M = mass  of  vehicle  (slugs) 

2 

g = acceleration  of  gravity  (ft/sec  ) 

C = coefficient  of  drag 

Vy  = rolling  friction  coefficient 
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x = velocity  (ft/sec) 

••  2 
x = acceleration  (ft/sec  ) 

Since  the  displacement,  x,  does  not  appear  in  (26),  we  can  write  this  as  a 

single  first  order  ordinary  differential  equation  (substituting  2/  for  x) 

. 2 

y 7 = - V/P  yl  + yJ3  (27) 

1 M2 

The  most  economical  procedure  to  obtain  sufficient  measurements  would  be  to 

coast  an  automobile  in  neutral  from  some  speed  like  say  80  miles/hour  and 

record  the  speed  at  intervals  of  say  5 seconds.  The  boundary  values  in 

Figure  1 are  velocities  in  ft/sec  at  5 second  intervals.  Since  (27)  is 

nonlinear,  the  usual  Newton  type  linearization  procedures  are  employed,  see 

Walker  [4]  in  this  proceedings  or  Childs  [1] , [2]  for  more  details. 

Figure  2 is  the  output  of  our  program  which  is  based  on  (27),  the  data  in 

Figure  1,  parameter  values  common  in  engineering  use,  and  automobile  parameters 

from  the  Road  and  Track  Test  Annual  for  1966  for  a Sunbeam  Alpine.  The 

coefficient  of  drag  C^=  y % = 0.5025  + 0.0690  and  coefficient  of  rolling  friction 

= y7  = 0.0169  + 0.0031  resulted. 
jo 

5.  CONCLUSIONS.  Regression  analysis  with  differential  equation  models 
is  feasible.  It  could  significantly  affect  design  of  experiments  when  such 
models  are  relevant.  Determination  of  the  parameters  in  the  simple  example 
would  be  expensive  if  one  had  to  use  wind  tunnels  or  treadmills. 
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MULTI-LEVEL  ADAPTIVE  SOLUTIONS  TO 
BOUNDARY- VALUE  PROBLEMS* 


Achi  Brandt 

Weizmann  Institute  of  Science,  Rehovot,  Israel 
IBM  Thomas  J . Watson  Research  Center , Yorktown  Heights , New  York 
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ABSTRACT . The  boundary-value  problem  is  discretized  on  several  grids 
(or  finite-element  spaces)  of  widely  different  mesh  sizes.  Interaction* 
between  these  levels  enable  us  (i)  to  solve  the  possibly  nonlinear  system 
of  n discrete  equations  in  0(n)  operations  (40n  additions  and  shifts  for 
Poisson  problems) . (ii)  To  conveniently  adapt  the  discretization  (the 
local  mesh  size,  local  order  of  approximation,  etc.)  to  the  evolving 
solution  in  a nearly  optimal  way,  obtaining  "“-order"  approximations 
and  low  n,  even  when  singularities  are  present.  General  theoretical  analysis 
of  the  numerical  process.  Numerical  experiments  with  linear  and  nonlinear, 
elliptic  and  mixed-type  (transonic  flow)  problems  - confirm  theoretical 
predictions . 
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1,  INTRODUCTION. 

In  most  numerical  procedures  for  solving  partial  differential  equations, 
the  analyst  first  discretizes  the  problem,  choosing  approximating  algebraic 
equations  on  a finite  dimensional  approximation  space,  and  then  devises  a 
numerical  process  to  (nearly)  solve  this  huge  system  of  discrete  equations. 
Usually,  no  real  interplay  is  allowed  between  discretization  and  solution 
processes.  This  results  in  enormous  waste:  The  discretization  process, 

being  unable  to  predict  the  proper  resolution  and  the  proper  order  of 
approximation  at  each  location,  produces  a mesh  which  is  too  fine.  The 
algebraic  system  thus  becomes  unnecessarily  large  in  size,  while  accuracy 
usually  remains  rather  low,  since  local  smoothness  of  the  solution  is  not 
being  properly  exploited.  On  the  other  hand,  the  solution  process  fails 
to  take  advantage  of  the  fact  that  the  algebraic  system  to  be  solved  does 
not  stand  by  itself,  but  is  actually  an  approximation  to  continuous  equations, 
and  therefore  can  itself  be  approximated  by  other  (much  simpler)  algebraic 
systems . 

The  purpose  of  the  work  reported  here  is  to  study  how  to  intermix 
discretization  and  solution  processes,  thereby  making  both  of  them 
orders-of-magnitude  more  effective.  The  method  to  be  proposed  is  not 
"saturated",  that  is,  accuracy  grows  indefinitely  as  computations  pro- 
ceed. The  rate  of  convergence  (overall  error  E as  function  of  compu-  . 
tational  work  W)  is  in  principle  of  "infinite  order",  e.g. , E n,  exp  !-BTi) 
for  a d-dimensional  problem  which  has  a solution  with  scale-ratios  ^j3>0; 
or  E exp  (-W  ' ) , for  problems  with  arbitrary  thin  layers  (see  Sec.  9). 
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The  basic  idea  of  the  Multi  Level  Adaptive  Techniques  (MLAT)  is  to 
work  not  with  a single  grid,  but  with  sequence  of  grids  ("levels")  of 
increasing  fineness , which  may  be  introduced  and  changed  in  the  process , and 
which  constantly  interact  with  each  other.  For  description  purposes,  it 
is  convenient  to  regard  this  technique  as  composed  of  two  main  concepts : 

(1)  The  Multi  Grid  (MG)  method  for  solving  discrete  equations.  This 
method  iteratively  solves  a system  of  discrete  (finite-difference  or  finite- 
element)  equations  on  a given  grid,  by  constant  interactions  with  a hier- 
archy of  coarser  grids,  taking  advantage  of  the  relation  between  different 
discretizations  of  the  same  continuous  problem.  This  method  can  be 
viewed  in  two  complimentary  ways : One  is  to  view  the  coarser  grids  as 

correction  grids , accelerating  convergence  of  a relaxation  scheme  on  the 
finest  grid  by  efficiently  liquidating  smooth  error  components. 

(See  general  description  in  Sec.  2 and  algorithm  in  Sec.  4.)  Another  point 
of  view  is  to  regard  finer  grids  as  the  correction  grids,  improving  ac- 
curacy on  coarser  grids  by  correcting  their  forcing  terms.  The  latter 
is  a very  useful  point  of  view,  making  it  possible  to  manipulate  ac- 
curate solutions  on  coarser  grids,  with  only  infrequent  "visits"  to 
pieces  of  finer  levels.  (This  is  the  basis  for  the  multi-grid  treatment 
of  non-uniform  grids;  cf.  Secs.  7.2  and  7.5.  The  FAS  mode  for  nonlinear 
problems  and  the  adaptive  procedures  stem  from  this  viewpoint.)  The  two 
seemingly  different  approaches  actually  amount  to  the  same  algorithm  (in 
the  simple  case  of  "coextensive"  levels) . 

The  multi-grid  process  is  very  efficient:  A discrete  system  of  n 

equations  (n  points  in  the  finest  grid)  is  solved,  to  the  desired  accuracy, 
in  0(n)  computer  operations.  If  P parallel  processors  are  available, 
the  required  number  of  computer  steps  is  0(n/P  + log  n) . For  example,  only 
40n  additions  and  shifts  are  required  for  solving  the  5-point  Poisson 
equation  on  a grid  with  n points  (see  Sec.  6.3).  This  efficiency  does 
not  depend  on  the  shape  of  the  domain,  the  form  of  the  boundary  conditions, 
or  the  mesh-size,  and  is  not  sensitive  to  choice  of  parameters.  The  memory 
area  required  is  essentially  only  the  minimal  one,  that  is,  the  storage  of 
the  problem  and  the  solution.  In  fact,  if  the  amount  of  numerical  data  is 
small  and  only  few  functionals  of  the  solution  are  wanted,  the  required 
memory  is  only  O(log  n) , with  no  need  for  external  memory  (see  Sec.  7.5). 

Multi-grid  algorithms  are  not  difficult  to  program,  if  the  various 
grids  are  suitably  organized.  We  give  an  example  (Appendix  B)  of  a 
FORTRAN  program,  showing  the  typical  structure,  together  with  its  computer 
output,  showing  the  typical  efficiency.  With  such  an  approach,  the  program- 
ming of  any  new  multi-grid  problem  is  basically  reduced  to  the  programming 
of  a usual  relaxation  routine.  The  same  is  true  for  nonlinear  problems 
where  no  linearization  is  needed,  due  to  the  FAS  (Full  Approximation  Storage) 
method  introduced  in  Sec.  5. 

Multi-grid  solution  times  can  be  predicted  in  advance,  - a 
recipe  is  given  and  compared  with  numerical  tests  (Sec.  6).  The  basic 
tool  is  the  local  mode  (Fourier)  analysis,  applied  to  the  locally 
linearized-f reezed  difference  equations,  ignoring  far  boundaries.  Such 
an  analysis  yields  a very  good  approximation  to  the  behavior  of  the 
high-frequency  error  modes,  which  are  exactly  the  only  significant  modes 
in  the  multi-grid  process,  since  the  low-frequency  error  modes  are 
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liquidated  in  the  coarse-grids  processing,  with  negligible  amounts  of  com- 
putational work.  Thus,  mode  analysis  gives  a very  realistic  prediction  of 
convergence  rates  per  unit  work.  (For  model  problems,  the  analysis  can  be 
made  rigorous;  see  Appendix  C.)  The  mode  analysis  can,  therefore,  be 
used  to  choose  suitable  relaxation  schemes  (Sec.  3)  and  suitable  criteria 
for  switching  and  interpolating  between  the  grids  (Appendix  A) . Our 
numerical  tests  ranged  from  simple  elliptic  problems  to  non-linear  mixed- 
type  (transonic  flow)  problems,  which  included  hyperbolic  regions  and 
discontinuities  (shocks).  The  results  show  that,  as  predicted  by  the 
mode  analysis,  errors  are  reduced  by  an  order  of  magnitude  (factor  10) 
expending  computational  work  equivalent  to  4 to  5 relaxation  sweeps  on 
the  finest  grid. 

(2)  Adaptive  discretization.  Mesh-sizes,  orders  of  approximation 
and  other  discretization  parameters  are  treated  as  spatial  variables. 

Using  certain  general  internal  criteria,  these  variables  are  controlled 
in  a sub-optimal  way,  adapting  themselves  to  the  computed  solution.  The 
criteria  are  devised  to  obtain  maximum  overall  accuracy  for  a given  amount 
of  calculations;  or,  equivalently,  minimum  of  calculations  for  given 
accuracy.  (In  practice  only  near-optimality  should  of  course  be  attenuated, 
otherwise  the  required  control  would  become  more  costly  than  the  actual 
computations.  See  Sec.  8.)  The  resulting  discretization  will  automatically 
resolve  thin  layers  (when  required) , refine  meshes  near  singlular  points 
— «^fchat  otherwise  may  "contaminate"  the  whole  solution) , exploit  local 
smoothness  of  solutions  (in  proper  scale) , etc.  (see  Sec.  9) . 

Multi-grid  processing  and  adaptive  discretization  can  be  used  in- 
dependently of  each  other,  but  their  combination  is  very  fruitful:  MG 

is  the  only  fast  (and  convenient)  method  to  solve  discrete  equations  on 
the  non-uniform  grids  typically  produced  by  the  adaptive  procedure.  Its 
iterative  character  fits  well  into  the  adaptive  process.  The  two  ideas 
use  and  relate  similar  concepts,  similar  data  structure,  etc.  In  particular, 
an  efficient  and  very  flexible  way  to  construct  any  adaptive  grid  is  as  a 
sequence  of  uniform  subgrids,  the  same  sequence  used  in  the  multi-grid 
process,  where  finer  levels  may  be  confined  to  increasingly  narrower  sub- 
domains  to  produce  the  desired  local  refinement.  In  this  structure,  the 
difference  equations  can  be  defined  separately  on  each  of  the  uniform  sub- 
grids, which  interact  with  each  other  through  the  multi-grid  process. 

Thus,  difference  equations  should  only  be  constructed  on  equidistant 
points,  which  makes  it  easy  to  employ  high  and  adaptive  orders  of  ap- 
proximation. Moreover,  the  finer,  localized  subgrids  may  be  defined  in 
terms  of  suitable  local  coordinates,  facilitating,  for  example,  the  use 
of  high-order  approximation  near  a certain  piece  of  the  boundary,  with 
all  these  pieces  naturally  patched  together  by  the  multi-grid  process 
(Sec.  7) . 

The  presentation  in  this  article  is  mainly  in  terms  of  finite- 
difference  solutions  to  partial-differential  boundary-value  problems. 

The  basic  ideas ,, however , are  more  general,  applicable  to  integro-dif- 
ferential  problems,  functional  minimization  problems,  etc.,  and  to 
finite-elements  discretization.  The  latter  is  briefly  discussed  in 
Secs.  A. 5 and  7.3. 
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2.  MULTI-GRID  PHILOSOPHY. 

0 1 M 

Suppose  we  have  a set  of  grids  G ,G  . ..,G  , all  approximating  the 
same  domain  ft  with  corresponding  meshsizes  hQ>h^>...>h  . For  s^n^P^-^c^ty 
one  can  think  of  the  familiar  uniform  square  grids,  with  the  mesh-size 
ratio  h^+^:h^*l:2.  Suppose  further  that  a differential  problem  of  the 
form 


(2.1)  LU(x)  = F(x)  in  ft,  AU(x)  = 4>(x)  on  the  boundary  3ft, 

k 

is  given.  On  each  grid  G this  problem  can  be  approximated  by  difference 
equations  of  the  form 

(2.2)  L^U*  (x)  = F*(x)  for  xeG*,  ANjk(x)  = ^(x)  for  xe3G^. 

(See  example  in  Sec.  3.1).  Wj;  are  interested  in  solving  this  discrete 
problem  on  the  finest  grid,  G . The  main  idea^is  to  exploit  the  fact 
that  the  discrete  problem  on  a coarser  grid,  G say,  approximates  the 
same  differential  problem  and  hence  can  be  used  as  a certain  approximation 
to  the  G problem.  A simple  use  of  this  fact  has  long  been  made  by 
various  authors  (e.g.,  (14));  namely,  they  first  solved  (approximately) 
the  G problem,  which  involves  an  algebraic  system  much  smaller  and  thus 
much  easier  to  solve  than  the  given  G problem,  and  then  they  iterpolated 
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their  solution  from  G to  G , using  the  result  as  a first  approximation 
in  some  iterative  process  for  solving  the  G problem.  A more  advanced 
technique  was^to  use  a still  coarser  grid  in  a similar  manner  when 
solving  the  G problem,  and  so  on.  The  next  natural  step^is  to  ask 
whether  we  can  exploit  the  proximity  between  the  G*  an^  G problems 
not  only  in  generating  a good  first  approximation  on  G , but  also  in  the 
process  of  improving  the  first  approximation. 

M M 

More  specifically  let  u be  an  approximate  solution  of  the  G problem, 

and  let 


(2.3) 


M 

L u 


= F - f 


.M  M 
A u 


.M 

= 4>  -( 


M 


The  discrepancies  fM  and  4>M 


are  called  the  residual  functions,  or  residuals. 

5 for  the  non- 
where  the 


Assuming  for  simplicity  that  L and  A are  lineag  (gf.  Sec. 
linear  case) . the  exact  discrete  solution  is  u =u  +v  , wh 
correction  v satisfies  the  residual  equations 

(2.4)  LM  V**  = A A»  V*  = *M. 


Can  we  solve  this  equation,  to  a good  first  approximation,  again  by  inter- 
polation from  solutions  on  coarsgr  grids?  As  it  is,  the  answer  is 
generally  negative. k Not  every  G -problem  has  meaningful  approximation 
on  a coarser^grid  G . For  instance,  if  the  right-hand-side  f fluctuates 
rapidly  on  G , with  wavelength  less  than  these  fluctuation^  are  not 

visible  on  coarser  grids.  Such  rapidly- fluctuating  residuals  f are  ex- 
actly what  we  get  when  the  approximation  u has  itself  been  obtained  as 
an  interpolation  from  a coarser-grid  solution. 

An  effective  way  to  damp  rapid  fluctuations  in  residuals  is  by  usual 
relaxation  procedures,  e.g.,  the  Gauss-Seidel  relaxation  (see  Sec.  3). 

At  the  first  few  iterations  such  procedures  usually  seem  to  have  fast 
convergence,  with  residuals  (or  corrections)  rapidly  decreasing  from  one 
iteration  to  the  next,  but  soon  after  the  convergence  rate  levels  off 
and  becomes  very  slow.  Closer  examination  (see  Sec.  3 below)  shows  that 
the  convergence  is  fast  as  long  as  the  residuals  have  strong  fluctuations 
on  the  scale  of  the  grid.  As  soon  as  the  residuals  are  smoothed  out, 
convergence  slows  down. 

This  is  then  exactly  the  point  where  relaxation  sweeps  should  be 
discontinued  and  approximate  solution  of  the  (smoothed  out)  residual 
equations  by  coarser  grids  should  be  employed. 

The  Multi-Grid  (MG)  methods  are  systematic  methods  of  mixing  relax- 
ation sweeps  with  approximate  solution  of  residual  equations  on  coarser 
grids.  The  residual  equations  are  in  turn  also  solved  by  combining 
relaxation  sweegs  with  corrections  through  still  coarser  grids,  etc.  The 
coarsest  grid  G is  coarse  enough  to  make  the  solution  of  its  algebraic 
system  inexpensive  compared  with,  say,  one  relaxation  sweep  over  the 
finest  grid. 
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The  following  sections  further  explain  these  ideas.  Sec.  3.1  explains, 
through  a simple  example,  what  is  a relaxation  sweep  and  shows  that  it  indeed 
smooths  out  the  residuals  very  efficiently.  The  smoothing  rates  of  general 
difference  systems  are  summarized  in  Sec.  3.2.  A full  multi-grid  algorithm, 
composed  of  relaxation  sweeps  over  the  various  grids  with  suitable  inter- 
polations in  between,  is  then  presented  in  Sec.  4.  An  important  modification 
for  nonlinear  problems  is  described  in  Sec.  5 (and  used  later  as  the  basic 
algorithm  for  non-uniform  grids  and  adaptive  procedures) . Appendix  A supple- 
ments these  with  suitable  stopping  criteria,  details  of  the  interpolation 
procedures  and  special  techniques  (partial  relaxation) . 


3.  RELAXATION  AND  ITS  SMOOTHING  RATE. 


3.1  An  Example . Suppose,  for  example,  we  are  interested  in  solving 
the  partial  differential  equation. 

(3.1)  LU(x,y)  = a a2pt».jy>.  ,b  aW>  - F(x,y) 


3x 


3y 


with  some  suitable  boundary  conditions. ^ Denoting  by  U*  and  F"  approximations 
of  U and  F,  respectively,  on  the  grid  G , the  usual  second-order  discretization 
of  (3.1)  is 


U * _ 2U  k + u k 

(3.2)  lV  . -=  , , b 

o » p , * 


u k - 2U  k + u k 
o,B+l  a,  3 a, 6-1 


= F 


a, 3 


where 


Fk  a = Fk(ah.,Bh.);  o,3  integers. 

Ot  f p K K 


(In  the  multi-grid  context  it  is  important  to  define  the  difference  equations 
in  this  divided  form,  without,  for  example,  multiplying  throughout  by  h^  , 
in  order  to  get  the  proper  relative  scale  at  the  different  levels.)  Given 
an  approximation  u to  U , a simple  example  of  a relaxation  scheme  to  improve 
it  is  the  following. 

Gauss-Seidel  Relaxation:  The  points  (o,3)  of  G are  scanned  one  by  one 

in  some  prescribed  order;  e.g.,  lexicographic  order.  At  each  point  the  value 

u „ is  replaced  by  a new  value,  u a,  such  that  equation  (3.2)  at  that  point 

o , 3 o,B 

is  satisfied.  That  is,  u „ satisfies 

o,B 


(3.3) 


ug+l, 3 2uq,3  + U o-l,3 


+ b 


ug,B+l  ~ 2uq. 3+  U a, 3-1 


= F 


\ 


a, 3' 


where  the  new  values  u R u „ are  used  since,  in  the  lexicographic 
order,  by  the  time  {o,§y1li 'scaAnea  new  values  have  already  replaced  old 
values  at  (o-l,B)  and  (o,3-l) . 

A complete  pass,  scanning  in  this  jnanner  all  the  points  of  G , is 
called  a (Gauss-Seidel  lexicographic)  G relaxation  sweep.  The  new  ap- 
proximation u does  not  satisfy  (3.2),  and  further  relaxation  sweeps  may 
be  required  to  improve  it.  An  important  quantity  therefore  is  the 
rate  of  convergence,  y say,  which  may  be  defined  by 
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(3.4)  y = 


P 


|v| 


V ■>  V » 

, where  v = U -u,  v «=  u - u, 


||*||  being  any  suitable  discrete  norm. 

The  rate  of  convergence  of  the  above  relaxation  scheme  is 
asymptotically  very  slow.  Jhat  is,  except  for  the  first  few  relaxation  2 
sweeps  we  have  u = 1 - 0(hk  ).  This  means  that  we  have  to  perform  Oth^  ) 
relaxation  sweeps  to  reduce  the  error  order  of  magnitude. 

In  the  multi-grid  method,  however,  the  role  of  relaxation  is  not  to 
reduce  the  error,  but  to  smooth  it  out;  i.e.,  to  reduce  the  high-frequency 
components  of  the  error  (the  lower  frequencies  being  reduced  by  relax- 
ation sweeps  on  coarser  grids) . In  fact,  since  smoothing  is  basically 
a local  process  (high  frequencies  have  short  coupling  range) , we  can 
analyze  it  in  the  interior  of  G by  (locally)  expanding  the  error  in 
Fourier  series.  This  will  allow  us  to  study  separately  the  convergence 
rate  of  each  Fourier  component,  and,  in  particular,  the  convergence 
rate  of  high-frequency  components,  which  is  the  rate  of  smoothing. 

Thus  to  study  the  0=(0^, Fourier  component  of  the  error  functions 
v and  v before  and  after  the  relaxation  sweep,  we  put 

i (0  01+0.6)  i (0  a+0  6) 

(3'5)  vo,S  = A0  6 and  = *0  e 


Subtracting  (3.2)  from  (3.3),  we  get  the  relation 

(3-6)  a (vu+i,0  - 2Ve  + Vi,3>  + c (vu,6+i  - 2Ve  + Ve-i> 

from  which,  by  (3.5), 

i9l  i02  "i0l  "i02 

(ae  + ce  ) A0  + (ae  +c  e - 2a  - 2c)  Ag  = 0. 

Hence  the  convergence  rate  of  the  0 component  is 


= 0, 


(3.7)  y(0)  = |-£-|  = 

o 


i0l 

i02 

ae  + 

ce 

-i0.  -i0_ 

1 2 

2a+2c  - ae 

- ce 

Define  | 0 | = max  (|01|,|02l>.  In  domains  of  diameter  0(1)  the  lowest 

Fourier  components  t^ave  | 0 | - O(h^) , and  their  convergence  rate  therefore 
is,  y ( 0 ) = 1 - 0(h^  ).  Here,  however,  we  are  interested  in  the  rate  of 
smoothing,  which  is  defined  by 


(3.8) 


u = 


max 


y(0) , 


PIT  < 0 < TT 
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where  p is  the  mesh-size  ratio  and  the  range  pir  < 1 6 1 tt  is  the  suitable 
range  of  high-frequency  components,  i.e.,  the  range  of  components  that 
cannot  be  approximated  on  the  coarser  grid,  because  its  mesh-size  is 

A ....  . A 1 


h^_^  = h^  / p.  We  will  assume  here  that  p = -- 
(cf.  Sec.  6.2). 


which  is  the  usual  ratio 


Consider  first  the  case  a 
culation  shows  that  y = y ( — , 
factory  rate;  it  inplies  that  3 


c (Poisson  equation) . A simple  cal- 
arccos  — ) = .5.  This  is  a very  satis- 
r e 1 ax ation  sweeps  reduce  the  high- 


frequency  error- components  by  almost  an  order  of  magnitude.  Similar 
rates  are  obtained  for  general  a and  c,  provided  a/c  is  of  moderate 


size. 


The  rate  of  smoothing  is  less  remarkable  in  the  degenerate  case 
a<<c  (or  c<<a) . For  instance, 


(3.9) 


y(  j , 0 ) = 


2 2 
a + c 


1 1/2 


2 2 
a + (c+2a) 


which  approaches  1 as  a -*■  o.  Thus,  for  problems  with  such  a degeneracy, 
Gauss-Seidel  relaxation  is  not  a suitable  smoothing  scheme.  But  other 
schemes  exist.  For  example. 

Line  Relaxation;  Instead  of  treating  each  point  (a, 3)  of  G separ- 
ately, one  take* simultaneously  a lige  of  points  at  a time,  where  a line 
is  the  set  of  all  points  (a, 3)  in  G with  the  same  a (a  vertical  line). 

All  the  values  u „ on  such  a line  are  simultaneously  replaced  by  new 
values  u which  simultaneously  satisfy  all  the  equations  (3.2)  on  that 
line.  P (This  is  easy  and  inexpensive  to  do,  since  the  system  of 
equations  to  be  solved  for  each  such  line  is  a tridiagonal,  diagonally 
dominant  system.  See,  e.g. , in  [17].)  As  a result,  we  get  the  same  relation 

as  (3.3)  above,  except  that  u is  replaced  by  u a±..  Hence,  instead 

. , . ...  , ot , l o;,3'i 

of  (3.7)  we  will  get: 


(3.10) 


y (0) 


-ie. 


2 (a+c  - c cosG^) 


ae 


from  which  one  can  derive  the  smoothing  rate 


r» 


(3.11) 


y = max 


-1/2 


} • 


a+2c 

which  is  very  satisfactory,  even  in  the  degenerate  case  a<<c. 


<•> 

& 


X 1 

f 

i* 


3.2.  General  results.  The  above  situation  is  very  general  (see  [4] 
and  Chapter  3 of  [3]):  For  any  uniformly  elliptic  system  of  difference 

equations,  it  can  be  shown  that  few  relaxation  sweeps  are  enough  to 
reduce  the  high-frequency  error  components  by  an  order  of  magnitude. 
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The  same  holds  for  degenerate  elliptic  systems,  provided  a suitable  relax- 
ation scheme  is  selected.  A scheme  of  line-relaxation  which  alternately 
use  all  line  directions  and  all  sweeping  directions  is  suitable  for 
any  degenerate  case.  Moreover,  such  a scheme  is  suitable  even  for  non- 
elliptic  systems,  provided  it  is  used  ''selectively" ; i.e.,  the  entire 
domain  is  swept  in  all  directions,  but  new  values  are  not  introduced 
at  points  where  a local  test  shows  the  equation  to  be  non-elliptic  and 
the  forward  characteristic  direction  to  conflict  with  the  current 
sweeping  direction. 

By  employing  local  mode  analysis  (analysis  of  Fourier  components) 
similar  to  the  example  above,  one  can  explicitly  calculate  the  smoothing 
rate  p for  any  given  difference  equation  with  any  given  relaxation  scheme. 
(Usually  p should  be  calculated  numerically;  an  efficient  FORTRAN  sub- 
routine exists;  typical  values  are  given  in  Table  1,  in  Sec.  6.2).  In 
this  way,  one  can  select  the  best  relaxation  scheme  from  a given  set  of 
possibilities.  The  selection  of  the  difference  equation  itself  may  also 
take  this  aspect  into  account.  This  analysis  can  also  be  done  for 
non-linear  problems  (or  linear  problems  with  non-constant  coefficients) , 
by  local  linearization  and  coefficients  freeze.  Such  localization  is 
fully  justified  here,  since  we  are  interested  only  in  a local  pro- 
perty (the  property  of  smoothing.  By  contrast,  one  cannot  make  similar 
mode  analysis  to  predict  the  overall  convergence  rate  p of  a given  relax- 
ation scheme,  since  this  is  not  a local  property). 

An  important  feature  of  the  smoothing  rate  p is  its  insensitivity. 

In  the  above  example  no  relaxation  parameters  were  assumed.  We  could 

introduce  the  usual  relaxation  parameter  w;  i.e.,  replace  at  each  point 

the  old  value  u . not  with  the  calculated  u „ , but  with 
a, 8 u,6' 

u , + w(u  -u  .)  . The  mode  analysis  shows,  however,  that  no  iii  / 1 
o,6  o,6  o,6 

provides  a smoothing  rate  better  than  co=l.  In  other  cases,  w=l  is  not 
optimal,  but  its  p is  not  significantly  larger  than  the  minimal  p.  In 
delayed-displacement  relaxation  schemes  a value  w < “crjLtical  < ^ should 

often  be  used  to  obtain  p < 1,  but  there  is  no  sensitive  dependence  on  the 
precise  value  of  u>,  and  suitable  values  are  easily  obtained  from  the  local 
mode  analysis.  Generally,  the  smoothing  rate  of  delayed-displacement 
schemes  is  somewhat  worse  than  that  of  immediate-displacement  schemes, 
and  the  latter  should,  therefore,  be  preferred,  except  when  parallel 
processing  is  used. 


3.3.  Acceleration  by  weighting.  The  rate  of  smoothing  p may  some- 
times be  further  improved  by  various  parameters  introduced  into  the 
scheme.  Since  p is  reliably  obtained  from  the  local  mode  analysis,  we 
can  optimize  these  parameters  to  minimize  p.  For  linear  problems, 
such  optimal  parameters  can  be  determined  once  and  for  all,  since  they 
do  not  depend  on  the  shape  of  the  domain.  For  nonlinear  problems  precise 
optimization  is  expensive,  and  one  should  prefer  the  simpler,  more  ro- 
bust relaxation  schemes,  such  as  SOR. 


One  general  way  of  parametrization  is  the  weighting  of  corrections. 
We  first  calculate,  in  any  relaxation  scheme,  the  required  correction 


6u  = 
v 


- u 


(where  v = (a, 6)  or,  for  a general  dimension  d. 
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v = ^V1»V2'  vj  integers).  Then,  instead  of  introducing  these 

corrections,  we  actually  introduce  corrections  which  are  some  linear 
combination  of  du^  at  neighboring  points.  That  is,  the  actual  new 

values  are 


(3.12) 


. 

u = u + £„  co  6u  , 

v v yeF  y v+y 


where  the  weights  co 


are  the  parameters,  y = (y^y^ 


• ,Yd) ' 


Yi 


in- 


tegers and  T a small  set  near  (o,o,  ...,o).  For  any  fixed  T we  can 
optimize  the  weights.  In  case  T = {o},  gj  is  the  familiar  relaxation 
parameter.  Weighting  larger  T is  useful  in  delayed  displacement  relax- 
ation schemes.  For  immediate-displacement  line  relaxation,  weighting 
along  the  line  may  be  useful. 


Examples . In  case  of  simultaneous  displacement  (Jacobi)  relaxation 

for  the  5-points  Poisson  equation,  the  optimal  weights  for  F={o}  is 

w = .8,  for  which  the  smoothing  rate  is  y = .60.  For  the  set 
oo 

r = { (y  ,y0) : |y. | + l y I £ 1 } the  optimal  weights  are  u>  = 6co  = 

1 c -L  Z OO  O f t1 

6<o  = 48/41,  yielding  y = 9/41.  This  rate  seems  very  attractive; 

• X f u 

the  smoothing  obtained^j^one^swe^^e^uals  that  obtained  by 

(log  ) / (log  j)  = 2.2  sweeps  of  Gauss-Seidel  relaxation.  Actually, 

however,  each  sweep  of  this  weighted-Jacobi  relaxation  requires  9 additions 
and  3 multiplications  per  grid  point,  whereas  each  Gauss-Seidel  sweep 
requires  only  4 additions  and  1 multiplication  per  point,  so  that  the 
two  methods  have  almost  the  same  convergence  rate  per  operation,  Gauss- 
Seidel  being  slightly  faster.  The  weighted  Jacobi  scheme  is  considerably 
more  efficient  than  any  other  simultaneous-displacement  scheme,  but  like 
any  carefully  weighted  scheme,  it  is  considerably  more  sensitive  to 
various  changes. 


The  acceleration  by  weighting  can  be  more  significant  for  higher- 
order  equations.  For  the  13-points  biharmonic  operator,  Gauss-Seidel 
relaxation  requires  12  additions  and  3 multiplications  per  grid  point 
and  gives  y = .802,  while  weighted  Jacobi  (with  weights  ioqo  = 1.552, 

u , = co  , = .353)  requires  17  additions  and  5 multiplications  per 

o , +1  +1,0 

point  and  gives  y = .549,  which  is  2.7  tiiges  faster.  (The  best  relax- 
ation sweep  for  the  biharmonic  equation  A U = F is  to  write  it  as  the 
system  AV=F,  AU=V  and  sweep  Gauss-Seidel,  alternatively  on  U and  V. 
Such  a double  sweep  costs  8 additions  and  2 multiplications  per  grid 
point,  and  yields  y=.5.  But  a similar  procedure  is  not  possible  for 
general  4-th  order  equations.) 


519 


4.  A MULTI-GRID  ALGORITHM  (CYCI.E  P)  FOR  LINEAR  PROBLEMS 

There  are  several,  actual  algorithms  for  carrying  out  the  basic  multi- 
grid idea,  each  with  several  possible  variations.  We  present  here  an  algor- 
ithm (called  "Cycle  C"  in  [3])  which  is  easy  to  program,  generally  ap- 
plicable and  never  significantly  less  efficient  than  the  others  ("Cycle  A" 
and  "Cycle  B" ).  The  operation  of  the  algorithm  for  linear  problems  is 
easier  to  learn,  and  is  therefore  described  first.  In  the  next  section 
the  FAS  (Full-Approximation-Storage)  mode  of  operation,  suitable  for  non- 
linear problems  and  other  important  generalizations,  will  be  described. 

A flow-chart  of  the  algorithm  is  given  in  Figure  1.  (For  completion,  we 
also  flowchart,  in  Fig.  2,  Cycles  A and  B.)  A sample  FORTRAN  program  of 
this  cycle,  together  with  a computer  output,  is  given  in  Appendix  B. 

M 

Cjjjjcle  C starts  with  some  approximation  u being  given  on  the  finest 
grid  G . In  the  linear  case  one  can  start  wi?£|  any  approximation,  but 
a major  part  of  the  computations  is  saved  if  u has  smooth  residuals 

M ° m M M 

(e.g.,  if  u satisfies  the  boundary  conditions  and  L u -F  is  smooth. 

As  explained  in  Sec.  6,  smoothing  the  residuals  involves  most  of  the 
computational  effort).  In  the  nonlinear  case,  one  may  have  to  use  a 
continuation  procedure,  usually  performed  on  coarser  grids 
(cf.  Sec.  8.2).  Even  forMlinear  problems,  the  most  efficient 
algorithm  is  to  gbjain  uq  by  interpolating  from  an  approxi- 
mate solution  u calculated  on  G by  a similar  al- 
gorithm. (Hence  the  denomination  "cycle"  for  our  present 
algorithm,  which  would  generally  serve  as  the  basic  step  in 
processes  of  continuation,  refinement  and  grid  adaptation, 
or  as  a time  step  in  evolution  probgeijs) . Fog  highest  ef- 
ficiency, the  interpolation  from  u to  u should  bg  jf 
sufficiently  high  order,  to  exploit  all  smoothness  in  u 
(Cf.  (A. 7)  in  Sec.  A. 2,  and  see  also  Sec.  6.3.) 

The  basic  rule  in  Cycle  C is  that  each  v (the  function  defined  on 
the  grid  G ; k=0,  . . , M-l)  is  designed  to  serve  as  a correction^^r 

the  approximation  v previously  obtained  on  the  next  finer  grid  G , 
if  and  when  that  approximation  actually  requjL£<p  a coarse-grid  cor- 
rection, i.e.,  if  and  when  relaxation  over  G exhibits  slow  rate  of 
convergence.  Thus,  the  equations  to  be  (approximately)  satisfied  by 
v are 

(4.1)  lV  - f\  AV  - 

k k 

where  f and  <J>  are  the  residuals  (to  the  ig^|rior  equation  and  the 
boundary  condition,  respectively)  left  by  v , that  is. 


k _ k ...k+1  rk+l  k+1 
£ * IJc+l  <£  -tv). 


_ k ,,k+l  » k+1  k+1. 

!k+i  <*  - A v ) 


1 k 

We  denote  by  V the  functions  iji  the  equations , to  distinguish  from 
their  competed  approximations  v . When  v is  changing  in  the  algorithm 
(causing  V to  change) , V remains  fixed. 
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We  use  the  potation  1^  to  represent  int<‘i polai  ion  from  G to  G . In 
case  m>k,  1^  may  represent  a simple  transfer  of  values  to^the  coarser 
grid  G from  the  corresponding  points  in  the  finer  grid  G ; or  instead, 
it  may  represent  jrransfer  of  some  weighted  averages.  In  case  k>m,  as  in 
step  (e)  below,  I is  usually  a polynommal  interpolation  of  a suitable 
order  (at  least  ti?e  order  of  the  differential  equation.  See  Secs.  A. 2 
and  A. 4 for  more  details). 

The  equa^^ns  on  G are  thus  defined  in  terms  of  the  approximate 
solution  on  G .On  the  finest  grid  G , the  equations  are  the  original 
ones;  namely 


(4.3)  fM=FM, 


,M  aM 

p = $ , 


M M 
V = u . 


The  steps  of  the  algorithm  are  the  following: 

(a)  Set  k-*-M  (k  is  the  working  level,  and  we  start  a^  the^inest 
level) , and  introduce  the  given  approximation  v uq  . 

(b)  Improve  v by  one  relaxation  sweep  for  the  difference  equations 
(4.1).  Symbolically,  we  write  such  a sweep  as 


Relax 


ik  y, 
9 v 


(c)  If  relaxation  has  sufficiently  converged  (the  precise  cri- 
terion is  described  in  Secs.  A. 7 and  A. 8) , go  to  Step  (f ) . 

If  not,  and  if  the  convergence  rate  is  still  fast  (by  a cri- 
terion given  in  Sec.  A. 6)  go  back  to  Step  (b) . If  con- 
vergence is  not  obtained  and  the  rate  is  slow,  go  to  Step  (d) . 

(d)  If  k=o  (the  slow  convergence  has  taken  place  at  the  coarsest 
grid  G°) , go  back  to  Step  (b)  (to  continue  relaxation  never- 
theless, since  on  G°  relaxation  is  very  inexpensive.  If,  how- 
ever, the  problem  is  indefinite,  then  slow  rate  of  divergence 

may  occur,  in  which  case  the  G°  problem  should  be  solved  directly. 
This  is  as  inexpensive  as  relaxation,  but  requires  additional 
programming.  See  Sec.  4.1  below).  If  k>o,  lower  k by  1 (j:o 
compute  correction  on  the  next,  coarser  ^Level) . Compute  f and 
4>  on  this  new  level,  using  (4.2),  put  v =o  as  the  starting  ap- 
proximation, and  go  to  Step  (b) . 

(e)  If  k=M  (convergence  has  been  obtained  oji  the  finest  level) , 
the  algorithm  is  terminated.  If  k<J1+|v  has  converged  and  is 
ready  to  serve  as  a correction  to  vK  ) , put 

k+1  k+1  , Tk+1  k 

k 


Then  advance  k by  1 (to  resume  computations  in  the  finer  level) 
and  go  to  Step  (b) . 


The  storage  required  for  this  algorithm  is  only^a  fraction  more  than 
the  number  of  locations,  2n  say,  required  to  store  u and  F on  the  finest 
grid.  Indeed,  for  a d-dimensional ^problejjj, ^ a storage  of  roughly  2n/2 
legations  is  required  to  store  v and  f , the  next  level  requires  2n/ 

2 , etc.  The  total  for  all  levels  is 


523 


(4.6) 


j 2d 

(4.6)  2n(l+2  +2_  +...  ) < 2n  . 

2d  -1 

(In  the  FAS  version  below,  a major  reduction  of  storage  area  is  possible 
through  segmental  refinement.  See  Sec.  7.5  ) 

4.1.  Indefinite  Problems  and  the  Size  of  the  Coarsest  Grid.  If,  on 
— ■ - ■ — - — - ■ ■ 

any  grid  G , the  boundary  value-problem  (4.1)  is  a non-definite  elliptic 
problem,  with  eigenvalues 

(4.7)  A*  i *2  £ ...  < ° < i ■ 

and  with  the  corresponding  eigenfunctions  V^,  V^,...,V^,  V^^,..., 

then  it  cannot  be  solved  by  straight  relaxation.  Any  relaxation  gweep 
will  reduce  the  error  components  in  the  space  spanned  by  V^+^,  V£+2  ' 

but  will  magnify  all  components  in  the  span  of  V^,  V^,  . ..,  v£.  A multi- 
grid solution,  however,  is  not  seriously  affected  by  this  magnification, 
provided  the  magnified  components  are  suitably  reduced  by  the  coarse- 
grid  corrections.  This  will  usually  be  the  case,  since  these  components 
are  basically  of  low  frequency  and  are  well  approximated  on  coarser  grids. 
But  care  should  be  taken  regarding  the  coarsest  grid: 

On  the  coarsest  grid,  an  indefinite  problem  should  be  solved 
directly,  (i.e.,  not  by  relaxation  of  any  kind.  Semi-iterative  solutions, 
like  Newton  iterations  for  non-linear  problems,  are,  of  course,  per- 
missible). Furthermore,  this  grid  should  be  fine  enough  to  provide  rough 


approximation  to  A ^ for  any  k,  hence  also  for  the  corresponding 

differential  eigenfunctions.  This  means  that  G°  should  contain  at 
least  0(1.)  , probably  21,  points.  Or,  in  other  words,  G°  should  be  jjjst 
fine  enough  to  still  have  smoothing  capability  at  any  finer  level  GK. 

For  example,  if  SOR  relaxations  with  u<(d  are  used,  h should  satisfy 
(see  [4]  or  Sec.  3 in  [3]) 

(4.8)  Re  {B (0  ,h)  / b (h) } > o,  (o  < h < h ) , 

o — — o 

where  B(9,h)  is  the  symbol  of  i/1  (see  (A. 3)  in  Appendix  A)  and  b (h)  is  its 
central  coefficient. 

Usually,  G°  can  still  be  coarse  enough  to  have  the  direct  solution 
of  its  equations  still  far  less  expensive  than,  say,  one  relaxation 
sweep  over  the  finest  grid,  so  that  the  indefinite  problem  is  solved  with 
the  same  overall  efficiency  as  definite  problems. 


5.  THE  FAS  (FULL  APPROX I MAT I ON -STORAGE)  ALGORITHM. 

In  the  JTAS  mode  of  the  multi-grid  algorithms,  instead  of  storjina  a 
correction  v (designed  to  correct  the  finer-level  approximation  u ) , 
the  idea  is  to  storj;  the  full  current  approximation  u,  which  is  the  sum 
of  the  correction  v and  its  base  approximation  u*  : 
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(5.1) 


k _ k k+1  . k ,,  , , . 

u » I ,u  + v > (k“o,l , . . . ,M-1) . 


In  terms  of  these  full-approximation  functions,  we  can  rewrite  the  cor- 
rection equations  (4.1-3)  as 


(5.2) 

where 


lV  = Fk, 


AkUk  - Ik 


* , 


-k  k,  k k+1,  k ,-k+l  Tk+1  k+1, 

F - L (Ik+1  u ) + IR+1  (F  - L u ) , 


,c  ,,  rk  .k._  k k+1,  , _ k ,Tk+l  .k+1  k+1, 

(5.3)  4 « A <1  u ) + I x (4  - A u ) , 


(k-0,1,.. . ,M-1) , 


and  where  for  k“M  we  have  the  original  problem,  i.e., 


(5.4) 


-M  M 
F ■ F , 


4 


M 


aM 
4 . 


For  linear  problems,  equations  (5.2-4)  are  exactly  equivalent  to 
(4.1-3).  The  advantage  of  the  FAS  mode  is  that  equations  (5.2-4)  apply 
equally  well  to  nonlinear  problems.  To  see  this,  consider  for  instance 

the  nonlinear  equation  i/V*  - FM  given  on  the  finest  grid.  Given  an 
approximate  solution  u we  can  still  improve  it  by  relaxation  sweeps, 
with  smoothing  rates  ” (varying  over  the  domain,  but  still  reliably 
estimated  by  mode  analyses,  applied  locally  to  the  linearized-freexed  equation) 
As  in  the  linear  case,  the  smoothed -out  functions  are  the  residual 

f*  - fm  - LMUM 

and  the  correction  UM-uM.  Therefore,  the  equation  that  can  be  approx- 
imated on  coarser  grids  is  the  residual  equation 


lV  - -M  M 


L u 


f"  . 


Its  coarser-grid  approximation  is 

jM-iyM-! 


(5.5) 


tM-1  tM-1  M _M-1  -M 
L 1 M U " 1 M f ' 


which  is  the  sajjj^as  (5.2)  fog  k-M-1.  In  interpolating  U*1-1  (or  a computed 
approximation  u ) back  to  G , we  should  actually  interpolate 
X m-  X M 

v - I " u , because  this  is  the  coarse-grid  approximation  to  the 

M 


Again  we  distinguish  between  the^notatior.  used  to  write  the  equations 
and  the  computed  approximation  u . Equation  (5.2),  for  k<M,  is  not  equival- 
ent to  (2.2),  although  they  both  use  the  notation  l). 
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smoothed-ojat  function  l/^-u  . Similarly, interpolating  an  (approximate) 
solution  U of  (5.2)  to  the  finer  grid  GK  , the  polynominal  inter- 
polation should  operate  on  the  correction.  Thus  the  interpolation  is 

/c  c.\  k+1  ^ k+1  . rk+1  / k T k k+1, 

(5.6)  u + u + I k (u  - Ik+1  u ) , 

which  is  equivalent  to  (4.5).  Note  that  generally, 

Tk+1  T k k+1  , k+1 

1 k Xk+1  U * U • 

The  FAS  (Cycle  C)  algorithm  is  the  same  algorithm  as  in  Sec.  4,  with 
the  FAS  equations  (5.2-4)  replacing  (4.1-3),  and  with  (5.6)  replacing  (4.5). 
It  is  flowcharted  in  Fig.  3. 

The  FAS  mode  has  several  important  advantages:  It  is  suitable  for 

general  nonlinear  problems,  with  the  same  procedures  (relaxation  and  inter- 
polation routines)  used  at  all  levels.  Thus,  for  example,  only  one 
relaxation  routine  should  be  written.  Moreover,  this  mode  is  suitable  for 
composite  grids  (non-uniform  grids  created  by  increasingly  finer  levels 
being  defined  on  increasingly  smaller  subdomains;  see  Sec.  7.2),  which  is 
the  basis  for  grid  adaptation  on  one  hand,  and  segmental  refinement 
(see  Sec.  7.5)  on  the  other  hand.  Generally  speaking,  the  £asic  feature  of 
the  FAS  mode  is  that  the  function  stgre^  on  a coarse  grid  G coincides  there 
with  the  fine-grid  solution:  u = u • This  enables  us  to  manipulate 

accurate  solutions  on  coarse  grids. 

The  storage  required  for  the  FAS  algorithm  is  again  given  by  (4.6). 

With  segmental  refinement  (Sec.  7.5)  it  can  be  reduced  far  below  that, 
even  to  O(log  n) . 

An  important  by-product  of  the  FAS  mode  is  a good  estimate  for  the 
truncation  error,  which  is  useful  in  defining  natural  stopping  criteria 
(see  Sec.  A. 8)  and  grid-adaptation  criteria  (Sec.  8.3).  Indeed,  for 
any  k<m<M  it  can  easily  be  shown  (by  induction  on  m,  using  (5.2-3))  that 

-k  Tk  -m  ,k,_k  m.  k m m 

F - I F = L (I  u ) — I L u , 

m mm 

(5. 7) 

■rk  k r-m  ,k/Tk  m.  „k  „m  m 

$ - I $ = A (I  u ) — I A u , 

m mm 

in 

which  are  exactly  the  G approximations  to  the  G truncation  errors. 

A slight  disadvantage  of  the  FAS  mode  is  the  longer  calculation  required 
in  computing  F , which  is  almost  twice  as  long  as  the  calculation  of  f in 
the  former  (Correction-Storage)  mode.  This  extra  calculation  is  equivalent 
to  one  extra  relaxation  sweep  on  G , but  only  for  k<M,  and  is  about  5%  to 
10%  of  the  total  amount  of  calculations.  Hence,  for  linear  problems  on 
uniform  grids,  the  CS  mode  is  slightly  preferable. 


ft.  PERFORMANCE  ESTIMATES  AND  NUMERICAL  TESTS 

O.l.  Predictability.  An  important  feature  of  the  multi-grid  method 
in  that,  although  itera  tive,  its  total  computational  work  can  be  pre- 
dicted in  advance  by  local  mode  (Fourier)  analysis.  Such  an  analysis, 
which  linearizes  and  freezes  the  equations  and  ignores  distant  boundaries, 
gives  a very  good  approximation  to  the  behavior  of  high-frequency  com- 
ponents (since  they  have  short  coupling  range) , but  usually  fails  to  approx- 
imate the  behavior  of  the  lowest  frequencies  (which  interact  at  long 
distances}.  The  main  point  here,  however,  is  that  these  lowest  frequencies 
may  indeed  be  ignored  in  the  multi-grid  work  estimates,  since  their  con- 
vergence is  obtained  on  coarser  grids,  where  the  computational  work  is 
negligible.  The  purpose  of  the  work  on  the  finer  grids  is  only  to  converge 
the  high  frequencies.  Thus,  the  mode-analysis  predictions,  although  not 
rigorous,  are  likely  to  be  very  realistic.  In  fact,  these  predictions 
are  in  full  agreement  with  the  results  of  our  computational  tests.  (For 
rigorous  bounds  - see  App.  C) . 

6.2.  Multi-Grid  Rates  of  Convergence.  To  get  a convenient  measure 
of  convergence  per  unit  work,  we  define  as  our  Work  Unit  (WU)  ^he  com- 
putational work  in  one  relaxation  sweep  over  the  finest  grid  G . The 
number  of  computer  operation^  in  such  a unit  is  roughly  w»v  , where  *t 

is  the  number  of  points  in  G and  w is  the  number  of  operations  required 
to  compute  the  residual  at  each  point.  (In  parallel  processing  the  count 
should,  of  course,  be  different.  Also,  the  work  unit  should  be 
further  specified  when  comparing  different  discretization  and  relaxation 
schemes.)  If  the  mesh-size  ratio  is  p = h^^/h^  and  t^ie  problem's  domain 

is  d-dimensional,  then  a relaxation  sweep  over  Gm  ^ costs  approximately 
WUs  (assuming  the  grids  are  co-extensive,  unlike  those  in  Sec.  7) . 

Relaxation  sweeps  make  up  most  of  the  multi-grid  computational  work. 

The  only  other  p^ojess  thaj:  consumes  any  significant  amount  of  compu- 
tations is  the  I and  I interpolations.  It  is  difficult  to  measure 
them  precisely  in  WUs,  but  their  total  work  is  always  considerably  smaller 
than  the  total  relaxation  work.  In  the  example  in  Appendix  B,  the  inter- 
polation work  is  about  20%  of  the  relaxation  work.  Usually  the  percentage 
is  even  lower,  since  relaxing  Poisson  problems  is  particularly  inexpensive. 
To  unify  our  estimates  and  measurements  we  will  therefore  define  the  multi- 
grid  convergence  rate  p as  the  factor  by  which  the  errors  are  reduced  per 


one  WU  of  relaxation,  ignoring  any  other  computational  work  (which  is 
never  more  than  30%  of  the  total  work) . 

The  multi-grid  rate  of  convergence  may  be  estimated  by  a full  local 
mode  analysis.  The  following  is  a simplified  analysis,  which  gives  a 
good  approximation.  We  assume  that  the  relaxation  sweep  over  any  grid 


G affects  error  components  e 


where 


only  in  the  range 


0.  x.  , 

3 3 


l<j  <d 


1 


(The  9/h^  of  Sec.  3 and  Appendix  A is  denoted  here  0,  to  unify  the  dis- 
cussion of  all  levels.)  In  fact,  if  proper  interpolation  scheme  is  used 
(see  Sec.  A. 21  only  components  in  the  range  1 0 1 <_  (l+e)ir/  h^  ^ , say  , 
are  affected  by  interactions  with  coarser  grids.  But  if  proper  residual- 
weighting is  also  used  (to  make  a « 1;  cf.  Sec.  A. 4)  £hen  the  combined 
action  of  the  coarse-grid  correction  cycles  and  the  G relaxation  sweeps 
yields  convergence  rates  which  are  slowest  at  | Q | = ir/h^  ^ (cf.  App.  A). 
For  such  0 the  coarse-grid  cycles  have  neutral  effect,  sjince  o =1,  hence 
the  convergence  rate  is  indeed  as  affected  only  by  the  G relaxation 
sweeps . 

One  relaxation  sweep  over  G reduces  the  error  components  in  the 

range  ■■  - < 1 0 1 < by  the  smoothing  factor  p.  (See  Sec.  3.  If  the 

k-1  “ _hk 

smoothing  rate  near  a boundary  is  slower  than  p , which  is  not  the  usual 
case,  smoothing  may  be  accelerated  there  by  partial  relaxation  sweeps  - 
cf.  Sec.  A. 9.)  Thus  a multi-grid  cycle  with  s relaxation  sweeps  on 
each  level  reduces  all  error  components  by  the  factor  p . The  amount  of 
work  units  expended  in  these  sweeps  is 


Ad  A2d  , 
s + sp  + sp  + 


+ s£(M'1)d  < 


, Ad 
1-P 


Hence, 

(6.2) 


the  multi-grid  convergence  rate  is 


8 . 


which  is  not  much  bigger  than  p.  In  case  5>1,  the  effective  smoothing  rate 
(see  (A. 8))  should  replace  p in  this  estimate. 

Estimate  (6.2)  is  not  rigorous,  but  is  simple  to  compute  and  very 
realistic.  In  fact,  numerical  experiments  (Sec.  6.4-5)  usually  show  slightly 
faster  (smaller)  rates  p,  presumably  because  the  worst  combination  of 
Fourier  components  is  not  always  present. 

The  theoretical  multi-grid  convergence  rates , for  various  represent- 
ative cases,  are  summarized  in  Table  1. 


\ 

) ( 


> 
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Explanations  to  Table  1.  The  first  column  specifies  the  difference 
operator  and  the  dimension  d.  A denotes  the  central  second-order 
( (2d+l) -point)  approximation,  and  A'h  the2 fourth-order  ( (4d+l) -point  "star") 
approximation,  to  the  Laplace  operator.  A^  is  the  central  13-point  approxi- 
mation to  the  biharmonic  operator.  The  operators  3,3,3  and  3 are  the 
usual  central  second-order  approximations  to  the  corresponding  partYXl- 
differential  operators.  3 is  the  backward  approximation.  Upstream  dif- 
ferencing is  assumed  for  tfie  inertial  terms  of  the  Navier  stokes  equations; 
central  differencing  for  the  viscosity  terms,  forward  differencing  for  the 
pressure  terms,  and  backward  differencing  for  the  continuity  equation. 

Rh  is  the  Reynolds  number  times  the  mesh-size. 


The  second  column  specifies  the  relaxation  scheme  and  the  relaxation 
parameter  w.  SOR  is  Successive  Over  Relaxation,  which  for  u>=l  is  the 
Gauss-Seidel  relaxation.  xLSOR  (yLSOR)  is  Line  SOR,  with  lines  in  the 


k: ~ ::: — r~ — ; — — -J 


t<? 


TABLE  1.  (Cont'd.  Here  d=2,  p=l:2) 


Relax. 

Scheme  ui 

3 + e3  , e<<l 

SOR,  xLSOR  any 

xx  yy 

a3  + c3 

yLSOR  1 

xx  yy 

, . ,a  c.. 

(q  = min  (—  , — ) ) 

C 3 

ADLR 

SD,  yLSD,  ADLSD  1 

SD  (2q+2 ) / (3q+2 ) 

yLSD  (2a+2c) / (2a+3c) 
ADLSD  2/3,  2/3 

a.  - 2.  a 

h h x 


Navier  - Stokes 
with  large  Rh  in 
2 or  3 dimensions 


/c-l/2  a , 
max<5  , 


5_1/4  (l+2q)  1/2 


(q+2 ) / (3q+2 ) 
(2a+c) / (2a+3c) 
< 3~1/2  = .577 


yLSOR+ 

yLSOR- 

yLSORs 


SOR  (pressure 
corrected  by  the 
continuity  equation) , 
downstream  or  up- 
stream, with  any 
relaxation  parameters. 


max  , [5+6f|+2r|2]  ^ y 

,1  | 1+n  i < 

(3  '*5wT"1) 


< 3_1/2  = .577 


x (y)  direction.  yLSOR+,  yLSOR-  and  yLSORs  indicate,  respectively,  relax- 
ation marching  forward,  backward  and  symmetrically  (alternately  forward 
and  backward).  CSOR  means  Collective  SOR  (see  Sec.  3 in  [3])  and  the  attached 
oj ' s are  for  the  velocity  components  and  for  the  pressure.  ADLR  denotes 
Alternating  Direction  Line  Relaxation  (a  sweep  of  xLSOR  followed  by  a sweep 
of  yLSOR).  SD  is  Simultaneous  Displacement  (Jacobi)  relaxation,  WSD  is 
Weighted  Simultaneous  Displacement  with  the  optimal  weights  as  specified  in 
Sfjc.  3.3  above  (and  with  other  weights,  to  show  the  sensitivity).  WSDA  (for 
A ) is  like  WSD,  except  that  residuals  are  computed  in  less  operations  by 
making  first  a special  pass  that  computes  A u.  yLSD  is  y-lines  relaxation 
with  simultaneous  displacement,  ADLSD  is  the  corresponding  alternating-direction 
(yLSD  alternating  with  xLSD)  scheme. 

The  next  columns  list  p = h : h^+^  (see  discussion  below) , the  q 

smoothing  rate  p as  defined  by  (3.8)  .and  tljie  multi-grid  convergence  rate  p, 
calculated  by  (6.2).  We  also  list  | log  p|  , which  is  the  theoretical 
number  of  relaxation  Work  Units  required  to  reduce  the  error  by  the  factor  e, 
and  W^,  the  overall  multi-grid  computational  work  (see  Sec.  6.3).  To  make 
comparisons  of  different  schemes  possible,  we  also  list,  for  each  case,  the 
number  of  operations  ger  grid  point  per  sweep.  This  number  times  n (the 
number  of  points  on  G ) give  the  number  of  operations  in  a Work  Unit.  We 
list  only  the  basic  number  of  additions  and  multiplications  (counting  shifts 
as  multiplications) , thus  ignoring  the  operations  of  transferring  information, 
indexing,  etc.,  which  may  add  up  to  a significant  amount  of  operations,  but 
which  are  too  computer-  and  program^dependent  to  be  specified.  Also,  we 
assumed  that  the  right-hand  sides  f , including  r , are  stored  in  the  most 
efficient  form  (e.g.,  h r is  actually  stored).  Note  that  the  SOR  operation 
count  is  smaller  for  w=l  (Gauss-Seidel)  than  for  any  other  w. 


J. 


Numbers  in  this  table  were  calculated  by  Allan  S.  Goodman,  at  IBM  Thomas 
Watson  Research  Center.  A more  extensive  list  is  in  preparation. 


Mesh-size  ratio  optimization.  Examining  Tab^e  1,  and  many  other  unlisted 
examples,  it  is  evident  that  the^mesh-size  ratio  p = 1:2  is  close  to  optimal, 
yielding  almost  minimal  | log  p|  and  minimal  W^.  This  ratio  is  more  con- 
venient and  more  economic  in  the  interpolation  processes  (which  are  ignored 
in  the  above  calculations)  than  any  other  efficient  ratio.  In  practice, 
therefore,  the  ratio  p = 1:2  should  always  be  used,  giving  also  a very  desirable 
standardization . 


6.3.  Over-All  Multi-Grid  Computational  Work.  Denote  by  W^  the  com- 
putational work  (in  the  above  Work  Units)  required  to  solge  the  GM  pro- 
blem ((2.2),  k=M)  to  the  level  of  i^s^ truncation  errogs^T  (cf.  Sec.  A. 8). 
If  the  problem  is  first  solved  on  G to  the  level  T , and  if  the^cor- 
rect  order  of  interpolation  is  used  to  interpolate  the  solution  to  G (so 
that  unnecessary  high-frequencies  are  not  excited;  cf.  Sec.  g.2,  and  in 
particugaj  (A. 7)  for  i=l)  then  the  residuals  of  this  first  G approximation 
are  0(t  ) . The  computational  work  required  to  reduce  them  to  Oil  ) is 


M M- 1 
log  0(t  A ’I 


/ log  ° 


Hence , 
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(6.3) 


WM  - Vl  + 109 


T , , O 

H=T  7 109  u • 

T 


Similarly,  we  can  solve  the  G J problem  expending  work 


Vj  ■ Vh  + pAJd 


109  mT-T  7 109  ° 

T J 


(since  a GM  ^ work  unit  is  times  the  GM  unit) 


If  we  use  p-order  ap- 


proximation , then 


0(h/> 


k-1 


0<ViP) 


= 0(P  ) . 


Hence,  using  (6.4)  for  j=0 , 1 , 2 , . . .M-l  and  neglecting  W , 

o 


WM  1 


Ad  A2d  . A o 

(1+p  +p  +...)  p log  p / log  p . 


Or,  by  (6.2)  , 


Ad,  2 . 

(1  - p ) log  y 


(The  same  p was  assumed  in  computing  the  first  approximation  and  in  the 
improvement  cycles.  This  of  course  is  not  necessary.) 


Typical  values  of  this  theoretical  W are  shown  in  Table  1 above.  In 
actual  computations  a couple  of  extra  Work  Units  are  always  expended  in  solving 
a problem,  because  we  cannot  make  non-integral  number  of  relaxation  sweeps  or 
MG  cycles,  and  also  because  we  usually  solve  to  accuracy  below  the  level  of 
the  truncation  errors. 


For  5-points  Poisson  problems , for  example,  the  following  procedure  gives 


a GM  solution  with  residuals  smaller  than  tM.  (1)  Obtain  uM  1 on  GM  \ with 


^ M M M 

residuals  smaller  than  t . (ii)  Startinq  with  the  cubic  interpolation  u ■*-  I..  . u 

M“  1 


(preferably  by  using  the  difference  operator  itself;  cf.  [7]),  make  a MG  correc- 


tion cycle  such  as  Cycle  C with  n=o  (i.e.,  switching  to  G after  two  sweeps 


k k*"  1 k 

on  G ) , with  I transfer  by  injection  (cf . Sec.  A. 4)  and  I by  linear 

K , K"  1 

v 

interpolation,  and  with  "convergence"  on  G defined  as  obtained  after  the 


first  sweep  following  a return  from  G 


A precise  count  shows  Step  (ii) 


1/2  M 

to  require  30n  + 0(n  ' ) operations,  where  n is  the  number  of  points  in  G . 


Thus,  the  total  number  of  operations  is 


(1  + t + ...)30n  + 0(n1/2)  < 40n  + 0(n1/2) . 

4 lb  — 


Incidentally,  none  of  these  operations  is  a full  multiplication;  only 
additions  and  shifts  (multiplications  or  divisions  by  2 or  4)  are  used. 


* 


L*  *- 

\ 
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The  theoretical  for  this  problem  (sixth  line  in  Table  1)  amounts  to  only 
17. 5n  operations,  since  it  ignores  interpolation  work  C10.3n  operations  in 
the  above  procedure)  and  allows  non-integral  numbers  of  sweeps  and  cycles. 

In  fact,  numerical  tests  showed  the  above  algorithm  to  yield  residuals  con- 
siderably below  the  truncation  errors.  (The  only  cases  in  which  the  residuals 
approached  50%  of  the  truncation  errors  were  cases  with  high  smoothness,  in 
which  the  correct  MLAT  discretization  would  be  different;  namely,  of  higher 
order.  (Cf.  Sec.  8 and  the  remark  following  formula  (A. 7).) 


6.4.  Numerical  Experiments:  Elliptic  Problems.  A typical  numerical 

experiment  is  shown  in  Appendix  B,  including  the  FORTRAN  program  and  the 
computer  output.  The  output  shows  a multi-grid  convergence  rate 


o 

U 


.009051  ^ 
28.1  / 


1 

12.92 


= .537 


which  is  close  to,  and  slightly  faster  than,  the  theoretical  value  y = .595 
shown  in  Table  1. 

Many  numerical  experiments  with  various  elliptic  difference  equations 
in  various  domains  were  carried  out  at  the  Weizmann  Institute  in  1970-1972, 
with  the  collaboration  of  Y.  Shiftan  and  N.  Diner.  Some  representative 
results  were  reported  in  [2] , and  many  others  in  [11] . These  experiments 
were  made  with  other  variants  of  the  multi-grid  algorithm  (variants  A and 
B) , but  their  convergence  rates  agree  with  the  same  theoretical  rates  y. 

The  experiments  with  equations  of  the  form  aU  + cU  , with  a>>c,  showed 
poor  convergence  rates,  since  the  relaxation  scheme  Sed  was  Gauss-Seidel , 
and  not  the  appropriate  line  relaxation  (cf.  Sec.  3.1).  Some  of  these 
rates  were  better  than  predicted  by  the  mode  analysis,  because  the  grids 
were  not  big  enough  to  show  the  worst  behavior.  The  convergence  rates 
found  in  the  experiments  with  the  biharmonic  equation  were  also  rather 
poor  (although  nicely  bounded  away  from  1,  independently  of  the  grid  size), 
again  because  we  used  Gauss-Seidel  relaxations  and  injections  instead  of 
the  appropriate  schemes  (cf.  Sec.  3.3  and  A. 4) . All  these  points  were 
later  clarified  by  mode  analyses,  which  fully  explain  all  the  experimental 
results.  In  solving  the  stationary  Navier-Stokes  equations,  as  reported 
in  [2] , SOR  instead  of  CSOR  was  employed  (cf . table  1 above) , and  an  additional 
over-simplification  was  done  by  using,  in  each  multi-grid  cycle,  values  of  the 
nonlinear  terms  from  previous  cycle,  instead  of  using  the  FAS  scheme  (Sec.  5) . 

Nevertheless,  these  experiments  did  clearly  demonstrate  important 
features  of  the  multi-grid  method:  The  rate  of  convergence  was  essentially 

insensitive  to  several  factors,  including  the  shape  of  the  domain  0,  the 
right-hand  side  F (which  has  some  influence  only  at  the  first  couple  of 
cycles;  cf.  Sec.  A. 2)  and  the  finest  mesh-size  h (except  for  mild  varia- 
tions when  h^  is  large) . The  experiments  indicated  that  the  order  I of 

the  interpolations  I * should  be  the  order  of  the  elliptic  equation,  as 
shown  in  Sec.  A. 2 below.  (Note  that  in  [2]  the  order  was  defined  as  the 
degree  l of  the  polynominal  used  in  the  interpolation,  whereas  here 
I = 4+1.) 

More  numerical  experiments  are  now  being  conducted  at  the  Weizmann 
Institute  in  Israel  and  at  IBM  Research  Center  in  New  York,  and  will 
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be  reported  elsewhere.  We  will  hriefly  report  here  only  an  extreme  case 
of  the  multi-grid  tests  - the  solution  of  transonic  flow  problems . 


6.5.  Numerical  Experiments:  Transonic  Flow  Problems.  These  experi- 

ments were  conducted  in  1974  at  the  Weizmann  Institute  with  J.L.  Fuchs, 
and  recently  at  the  NASA  Langley  Research  Center  in  collaboration  with 
Dr.  Jerry  South  while  the  present  author  was  visiting  the  Institute 
for  Computer  Applications  in  Science  and  Engineering  (ICASE) . They  are 
preliminarily  reported  in  [12] , and  will  be  further  reported  elsewhere. 

One  purpose  of  this  work  was  to  examine  the  performance  of  the  multi-grid 
method  in  a problem  that  is  not  only  nonlinear,  but  more  significantly, 
is  also  of  mixed  (elliptic-hyperbolic)  type  and  contains  discontinuities 
(shocks) . 

We  considered  the  transonic  small-disturbance  equation  in  conservation 

form 

(6.7)  ((K-K<p  ) <p  ] + c =0, 

XXX  yy 

for  the  velocity  disturbance  potential  <j>(x,y)  outside  an  airfoil.  Here 
2 2/3  - 1 2 

K = (1-M  ) / t , K = — (y+1)  M , M is  the  free-stream  Mach  number, 

OO  2.  CO  00 

and  y=1.4  is  the  ratio  of  specific  heats,  t is  the  airfoil  thickness 
ratio,  assumed  to  be  small.  c=l,  unless  the  y coordinate  is  stretched. ^ 

The  airfoil,  in  suitably  scaled  coordinates,  is  located  at  {y=0,  |x|  <_— }, 
and  we  consider  nonlifting  flows,  so  that  the  problem  domain  can,  by 
symmetry,  be  reduced  to  the  half-plane  {y>o},  with  boundary  conditions 

2 2 

(6.8)  <[>(x,y)  o as  x +y  -►  ® 

for  [ x f > j , 
for  | x | < j , 


(6.9)  $y(x,o) 


o , 

F'  (x)  , 


where  t F(x)  is  the  airfoil  thickness  function  which  we  took  to  be 
parabolic.  Equation  (6.7)  is  of  hyperbolic  or  elliptic  type  depending 
on  whether  K -2K$x  is  negative  or  positive  (supersonic  or  subsonic) . 

The  difference  equations  we  used  were  essentially  the  Murman's  con- 
servative scheme  ([9];  for  a recent  account  of  solution  methods,  see  [8]), 
where  the  main  idea  is  to  adaptively  use  upstream  differencing  in  the 
hyperbolic  region  and  central  differencing  in  the  elliptic  region,  keep- 
ing the  system  conservative.  For  relaxation  we  used  vertical  (y)  line 
relaxation,  marching  in  the  stream  direction.  The  multi-grid  solution 
was  programmed  both  in  the  CS  (Sec.  4)  and  the  FAS  (Sec.  5)  modes, 
with  practically  thg  ^ame  results.  We  used  cubic  interpolation  for  I ^ 
and  injection  for  I 


535 


Local  mode  analysis  of  the  linearized-freezed  difference  equations 
and  vertical- forward  line  relaxation  gives  the  smoothing  rate 


(6.10) 


y = max 


{ 


b++b  +ib_ 


2c+b 


*}■ 


b+(x)  = K 


2K 


at  elliptic  (subsonic)  points,  and  y = o at  supersonic  points.  We  were 
interested  in^cases  where  K<1  and  <p  >o,  and  hence,  in  smooth  elliptic 
regions  (b  'v  b ) without  coordinate  stretching  we  get  y 'u  1/ [ 2+i  j = 0.45 

. o -3/4  “ c 

and  y = y =0.55. 

The  actual  convergence  rates , observed  in  our  experiments  with  mod- 
erately supercritical  flows  (M  =0.7  and  M^  = 0.85,  t = 0.1)  on  a 
64x32  grid,  were  y = 0.52  to  0.53,  just  slightly  faster  than  the  theor- 
etical value.  (See  detailed  output  in  [12] . The  wojrk  count  in  [12 j is 
slightly  different,  counting  also  the  work  in  the  I transition). 

For  highly  supercritical  flows  (M^  = 0.95,  t = 0.1)  the  MG  convergence 
rate  deteriorated,  although  it  was  still  3 times  faster  than  solution  by 
line  relaxation  alone.  The  worse  convergence  pattern  is  explainable  in 
terms  of  the  mode  analysis  for  the  elliptic  region  immediately  behind 
the  shock,  where  b+  >>  b , yielding  y closer  to  1.  Also,  the  fast  changes 
in  <(>  in  that  region  gives  o > 1 (see  Sec.  A.l)  , i.e.,  the  coarse  grid 
cycles  actually  magnify  the  Fourier  component  with  0 = (—  , 0) , the  same 
component  for  which  y is  closer  to  1.  This  worse  behavior  in  this  re- 
stricted region  further  affected  our  computations  because  we  did  not  use 
separate  stopping  tests  for  this  region  as  we  should  (see  Sec.  A. 6).  A 
correct  multi-grid  algorithm  for  this  problem  should,  therefore,  include 
symmetric  selective  line  relaxation  (see  Sec.  3.2),  or  partial  relaxation 
sweeps  (see  Sec.  A. 9) , or  both,  in  addition  to  residual  weighting  (Sec.  A. 4). 

Coordinate  stretching,  which  transforms  the  bounded  computational 
domain  to  the  full  half  plane,  gave  difference  equations  that  again 
exhibited  slow  multi-grid  convergence  rate.  This,  too,  is  explainable 
by  the  mode  analysis.  For  example,  in  the  regions  where  the  y coordinate 
is  highly  stretched,  c in  (6.7)  becomes  very  small  and  hence  y in  (6.10) 
is  close  to  1.  The  theoretical  remedies:  alternating-direction  line  re- 

laxations and  partial  relaxation  sweeps.  The  latter  was  tried  in  one 
simple  situation  (stretching  only  the  x coordinate) , and  indeed  restored 
the  convergence  rate  of  the  corresponding  unstretched  case. 


7.  NON-UNIFORM  GRIDS. 

Many  problems  require  very  different  resolution  in  different  parts  of 
their  domains.  Special  refinement  of  the  grid  is  required  near  singular 
points,  in  boundary  layers,  near  shocks,  and  so  on.  Coarse  grids  (with 
higher  approximation  order)  should  be  used  where  the  solution  is  smooth, 
or  in  subdomains  far  from  the  region  where  the  solution  is  accurately 


needed,  etc.  A general  method  for  locally  choosing  mesh-sizes  and  approx- 
imation orders  is  described  in  Sec.  8.  An  important  feature  of  the  method 
is  adaptivity;  the  grid  may  change  during  the  solution  process , adapting 
itself  to  the  evolving  solution.  In  this  section,  we  propose  a method  of 
organizing  non-uniform  grids  so  that  the  local  refinemegt  i|  highly  flexible 
The  main  idea  is  to  let  the  sequence  of  uniform  grids  G , Gi,  ...,  Gn  (cf. 
Sec.  2)  be  open-ended  and  non-coextensive  (i.e.,  finer  levels  may  be  intro- 
duced on  increasingly  narrower  subdomains  to  produce  higher  local  refinement 
and  coarser  levels  may  be  introduced  on  increasingly  wider  domains  to 
cover  unbounded  domains) , and,  furthermore,  to  let  each  of  the  finer  levels 
be  defined  in  terms  of  suitable  local  coordinates.  The  multi-grid  FAS 
process  remains  practically  as  before  (Sec.  5)  , with  similar  efficiency. 

Also  discussed  is  a method  which  employs  this  grid  organization  for 
"segmental  refinement",  a multi-grid  solution  process  with  substantially 
reduced  storage  requirement. 


7.1.  Organizing  Non-Uniform  Grids.  How  are  general  non-uniform 
grids  organized  for  actual  computations?  There  are  two  popular  approaches: 
One,  usually  used  with  the  finite  element  method,  is  to  keep  the  entire 
system  very  flexible,  allowing  each  grid  point  to  be  practically  anywhere. 
This  requires  a great  deal  of  bookkeeping:  grid-points'  locations  and 

pointers  to  neighbors  need  to  be  stored;  sweeping  over  the  grid  is  com- 
plicated; obtaining  the  coefficients  of  the  difference  equations  (or  the 
local  "stiffness")  may  require  lengthy  calculations,  especially  where 
the  grid  is  irregular;  and  these  calculations  should  be  repeated  each 
relaxation  sweep,  or  else  additional  memory  areas  should  be  allocated 
to  store  the  coefficients.  Also,  it  is  more  difficult  to  organize  a 
multi-grid  solution  on  a completely  general  grid  (see,  however,  Secs.  7.3 
and  A. 5),  and  complete  generality  is  not  necessary  for  obtaining  any 
desired  refinement  pattern. 

Another  approach  for  organizing  a non-uniform  grid  is  by  a 
coordinate  transformation,  with  a uniform  grid  being  used  over  the  trans- 
formed domain.  On  such  grids,  topologically  still  rectangular ,^t^e  multi- 
grid method  can  be  implemented  in  the  usual  way,  the  lines  of  G being 
every  other  line  of  G . Decisions  (stopping  criteria,  residual  weighting, 
relaxation  mode  and  relaxation  directions)  should  be  based  on  the  trans- 
formed difference  equations.  Very  often,  however,  coordinate  transformation 
does  not  offer  enough  flexibility.  A local  refinement  is  not  easy  to 
produce,  unless  it  is  a one-dimensional  refinement,  or  a tensor 
product  of  one-dimensional  refinements.  The  difficulties  are  enlarged 
in  adaptive  procedures,  where  it  should  be  inexpensive  to  change  local 
mesh-sizes  several  times  in  the  solution  process.  Moreover,  the  trans- 
formation usually  makes  the  difference  equation  much  more  complicated 
(requiring  additional  storage  for  keeping  coefficients,  or  additional 
work  in  recomputing  them  every  sweep) , especially  when  the  transformation 
does  become  sophisticated  (i.e.,  adaptive,  and  not  merely  a product  of 
one-dimensional  transformations) , and  in  particular  if  higher-order  approx- 
imations should  be  used  in  some  or  all  subdomains. 

Thus,  be  it  in  the  original  or  in  some  transformed  domain,  one  would 
like  to  have  a convenient  system  for  local  refinements,  with  minimal 
bookkeeping  and  efficient  methods  for  formulating  and  solving  difference 
equations.  The  following  system  is  proposed  (and  then  generalized, 

in  Secs.  7.3,  7.4) : 
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A non-uniform  grid  is  a union  of  uniform  sub-grids,  G , G , . . . ,G  , 
with  corresponding  mesh-sijs^j  hQ,  hlf...,h  . Usually  Itsh  =2:1  and 
every  other  grid  line  of  G is  a grid  line  of  G . Unlike  the  descrip- 
tion in  Sec.  2,  however,  the  sub-gjjds  are  not  necessarily  extended  over 
t|ie  same  domain.  The  domain  of  G may  be  only  part  of  the  domain  of 
G (but  not  vice  versa) . Thus  we  may  have  different  levels  of  refine- 
ment at  different  subdomains. 

For  problems  on  a bounded  domain  ft,  several  of  the  first  (the  coars- 
est) sub-grids  may  extend  to  the  entire  domain  ft.  That  is,  they  do  not 
serve  to  produce  different  levels  of  refinement,  but  they  are  kept  in 
the  system  fog  serving  in  the  multi-grid  process  of  solving  the  difference 
equations.  G should  be  coarse  enough  to  have  its  system  of  difference 
equations  relatively  inexpensive  to  solve  (i.e.,  requiring  legs  than 
0(£nk)  operations,  where  n^  is  the  number  of  grid  points  in  G*.  But  cf. 

Sec.  4.1).  The  finer  sub-grids  typically  extend  only  gver  certain  sub- 
domains  of  ft,  not  necessarily  connected.  Generally,  G is  stretched  over 
those  subdomains  where  the  desired  mesh-size  is  h^  or  less.  Thus,  very 

__ 

fine  levels  (e.g.,  with  M=20,  so  that  h^=2  hQ)  may  be  introduced,  provided 
they  are  limited  to  suitably  small  subdomains. 

Such  a system  is  very  flexible,  since  grid  refinement  (or  coarsening) 
is  done  by  extending  (or  contracting)  uniform  sub-grids.  There  are  several 
possible  ways  of  storing  functions  on  a (possibly  disconnected)  uniform 
grid,  allowing  for  easy  grid  changes.  For  example,  each  string  (i.e.,  con- 
nected row  or  column)  of  function  values  can  be  stored  separately,  at  an 
arbitrary  place  in  one  big  storing  area,  with  a certain  system  of  pointers 
leading  from  one  string  to  the  next.  The  extra  storage  area  needed  for 
these  pointers  is  small  compared  with  the  area  needed  for  storing  the 
function  values  themselves.  One  such  system,  with  subroutines  for 
creating,  changing  and  interpolating  between  the  grids,  is  now  under 
construction,  and  will  be  reported  elsewhere. 

If  the  (original  or  transformed)  problem's  domain  is  unbounded,  we 
usually  put  suitable  boundary  conditions  on  some  finite,  "far  enough" 
artificial  boundary.  In  the  present  system,  we  do  not  have  to  decide 
in  advance  where  to  place  the  artificial  boundary:  We  can  extend  (or 

contract)  the  coarsest  sub-grid(s)  as  the  jolutj^on  evolves.  Moreover, 
we  can  add  increasingly  coarser  levels  (G  , G , . . . ) to  cover  increasingly 
wider  domains,  if  required  by  the  evolving  solution.  In  this  way,  we  may 
reach  computational  domains  of  large  diameter  R,  by  adding  only  0(log  R) 
grid  points  (assuming  the  desired  mesh-size,  out  at  distance  r,  is  pro- 
portional to  r,  or  larger.  This  should  usually  be  the  case,  especially 
if  appropriate  higher-order  approximations  are  used  at  large  distances) . 

There  appears  to  be  a certain  waste  in  the  proposed  system,  as  one 
function  value  may  £e  stored  several  times,  when  its  grid  point  belongs 
to  several  levels  G . This  is  not  the  case.^  First,  because  the  amount 
of  such  extra  storage  is  small  (less  than  2 of  the  total  storage;  see 
(4.6)).  Moreover,  the  stored  values  are  exactly  those  needed  for  the 
multi-grid  process  of  solution:  In  fact,  in  that  process,  the  values 

stored  for  different  levels  at  the  same  grid-point  are  not  identical, 
they  only  converge  to  the  same  value  as  the  process  proceeds. 
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7.2.  The  Multi-Grid  Algorithm  on  Non-Uniform  Grids.  The  following 
is  a description  of  the  modification  in  the  FAS  multi-grid  algorithm  (Sec.  5) 
in  case  of  a non-uniform  grid  with  the  above  structure.  The  algorithm 
remains  almost  the  same,  except  that  the  difference  equations  (5.2-4) 
are  changed  to  take  account  of  the  fact  that  the  Revels  G do  not 
necessarily  cover  the  same  domain.  Denoting  by  the  set  of  points  of 

Gk  which  are  inner  points  of  a firmer  level  Gm  (i.e.,  points  where  the  G™ 
difference  equations  are  defined  ; see  Figure  4) , the  modified  form  of 
the  difference  equations  on  G is 


(7.1) 
where 

(7.2) 

(7.3) 

(7.4) 

(7.5) 


r k k -k 
L u = F , 


Fk  = Fk 


and 


Fk  = F k and 
k+1 


.k  k rk 
A u = $ , 


*k  . k k _ k , , . 

$ = <J>  m G -G,  , and  for  k=M, 

k+1 

t k k _ k 

* = *k+l  ln  Gk+1  ' 


„k  Tk  ,-m  m m k.k  m. 

F =1  (F  - L u ) + L (I  u ) , 
mm  m 

. k k .Tm  .m  m ,k._k  m. 

m in  m 


F and  $ , as  in  Sec.  2,  are  the  G approximation  to  the  original  right- 
hand  sides  F and  $,  respectively. 

Observe  that,  by  (7.2-3),  each  intermediate  lj|yel  G plays  a doubly 
role:  on  the  subdomain  where  the  finer  sub-grid  G is  not  defined,  G 

plays  the  role  of  the  finest  grid  and  the  difference  equation  there  is  an 
approximation  to  the  original  differential  equation.  ^At  the  same  time, 
on  the  subdomain  where  finer  sub-grids  are  present,  G serves  for 
calculating  the  coarse-grid  correction.  These  t^o  roles  are  not  confused 
owing  to  the  FAS  mode,  in  which  the  correction  v is  only  implicitly 
computed,  its  equation  being  actually  written  in  terms  of  the  jEull  approx- 
imation uk.  In  other  words,  F may  be^regarded  as  the  usjual  G right- 
hand  side  (F) , corrected  to  achieve  G accuracy  in  the  G solution. 


Indeed 

(7.6) 

which  is 


_k  ,k  -m  Tk.  k m Tk/Trn„m\ 

F - I F = L (I  u ) - I (L  u ) , 
mm  mm 

m k 

the  G approximation  to  the  G truncation  error. 


3 We  use  the  term  "inner",  and  not  "interior",  because  these  points  may 
well  be  boundary  points.  Indeed,  at  boundary  points  difference  equations 
are  defined,  although  they  are  of  a special  type,  called  boundarv  con- 
ditions. The  only  Gm  points  where  G™  difference  equations  are  not  de- 
fined are  points  on  or  near  the  internal  boundary  of  G ; i.e.  , the  boundary 
beyond  which  the  level  G is  not  defined,  but  some  coarser  levels  are.  If 
the  grid-lines  of  Gk  do  ngt  coincide  with  grid  lines  of  G , G^  is  defined 
as  the  set  of  points  of  G to  whicji  proper  interpolation  from  inner  points 

of  G is  well-defined.  For  m>M,  G is  empty. 

m 
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Figure  4.  Example  of  Non-Uniform  Grid. 

A section  of  the  domain  ft  and  its  boundary  3ft  is  shown, 

covered  with  a coarser  grid  G (line  intersections)  and  a 
k+1 

finer  grid  G (crosses  and  circles) . For  the  case  of 

k+1 

a 5-points  (or  9-points  "box")  difference  equations,  G 
inner  points  are  marked  with  crosses,  its  outer  points  with 

circles.  (For  convenient  interpolation,  outer  points  should 

k k 

lie  on  G lines) . At  outer  points  belonging  to  G , the 

k 

converged  solution  satisfies  the  G difference  equations, 
such  as  the  5-point  relations  indicated  by  squtres.  At 
other  outer  points,  such  as  those  shown  with  triangles,  the 
solution  is  always  an  interpolation  from  values  at  adjacent 
G points.  (Note  that  starting  values  at  outer  points  should 
be  such  that  these  interpolation  relations  are  satisfied.  The 
FAS  interpolation  steps  will  then  automatically  preserve  these 
relations . ) 


The  only  other  modification  required  in  applying  Cycle  C 
to  non-uniform  grids  is  in  the  convergence  switching  criteria. 
See  Sec.  A. 10. 


When  converged,  tjie  solution  so  obtained  satisfies  equations  (2.2) 
in  the  inner  part  of  G*  - G*  , (k=0 , 1, . . . ,M)  . On  outer  (i.e.,  non- 
inner)  points  the  solution  automatically  satisfies  either  a coarser-grid 
difference  equation  (if  the  point  belongs  to  a coarser  grid)  or  a coarser 
grid  interpolation  relation.  (See  Figure  4).  Note  that,  in  this  procedure, 
difference  equations  should  be  defined  on  uniform  grids  only.  This  is  an 
important  advantage.  Difference  equations  on  equi-distant  points  are 
much  simpler,  more  accurate.  The  basic  weights  for  eac^  tejjm  (e.g.,  the 
weights  (1,2,1)  for  the  secord  order  approximation  to  3 /3x  ) can  be  read 
from  small  standard  tables;  whereas  on  a general  grid  those  weights 
should  be  recomputed  (or  stored)  separately  for  each  point,  and  they  are 
very  complicated  for  high-order  approximations. 

Another  advantage  is  that  the  relaxation  sweeps,  too,  are  on  uniform 
grids  only.  This  simplifies  the  sweeping,  and  is  particularly  important 
where  symmetric  and  alternating-direction  sweeps  are  required  (cf . Sec.  3) . 

Numerical  experiments  indicate  that  the  typical  multi-grid  convergence 
rates,  measured  by  the  overall  error  reduction  per  work  unit  and  pre- 
dicted by  local  mode  analysis  (cf.  Sec.  6) , are  retained  in  multi-grid 
solutions  on  non-uniform  grids.  The  work  unit,  though,  is  somewhat 
different:  1^  is  the  computational  work  of  one  sweep  on  all  levels, 

not  only  on  G , since  here  G may  make  up  only  a small  part  of  the  points 
of  the  final  non-uniform  grid. 

7.3.  Finite-Elements  Generalization.  The  structure  and  solution 
process  outlined  above  can  be  generalized  in  various  ways.  An  im- 
portant generalization  is  to  employ  piece-wise  uniform,  rather  than 
strictly  uniform,  levels. 

Quite  often,  especially  in  problemsothat  use  finite-elements 
discretizations,  the  "basic"  partition  G (e.g.,  the  coarsest  triangulation) 
of  the  domain  ic  a non-uniform  one,  but  one  whidji  i^  particularly  suitable 
for  the  geometry  of  the  problem.  Finer  levels  G ,G  , .j..,  are  defined  as 
uniform  refinements  of  that  basic  level;  e.g.,  h^  = 2 h^;  so  that  h^ 

is  constant  within  each  basic  element. 

Jc  • 

Having  defined  the  levels  G in  this  manner,  the  rest  may  m 

principle  be  as  before:  The  actual  composite  grid  may  use  onjLy  certain, 

arbitrary  portions  of  each  level;  i.e.,  the  actual  subgrids  G need  not 
be  co-extensive,  allowing  for  adaptive  refinements.  Coarser  levels 
(G  , G ...,)  may  be  added  if  the  basic  level  G is  not  coarse  enough 
for  full-speed  multi-grid  solution.Q  (Although  therg  is  no  general  algor- 
ithm for  coarsening  a non-uniform  G , and  usually  G is  coarse  enough) . 
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Data  structures,  similar  to  the  uniform  case  may  be  used,  but  should  be 
constructed  separately  for  each  basic  element  (or  each  set  of  identical 
basic  elements) . 

The  multi-grid  algorithm  is  the  same  as  in  Sec.  7.2.  The  discrete 
equations  are  thus  defined  separately  for  each  level.  The  reproduction 
of  these  equations  during  relaxation  is  not  as  convenient  as  in  the 
strictly  uniform  case,  but  still,  in  the  interior  of  any  basic 
element  the  equations  can  readily  be  read  from  fixed  tables, 
one  table  for  each  set  of  identical  basic  elements. 

7.4.  Local  Transformations.  Another  important  generalization  of 
tne  above  structure  is  to  subgrids  which  are  defined  each  in  terms  of 
another  set  of  variables.  For  example,  near  a boundary  or  an  interface, 
the  most  effective  local  discretizations  are  made  in  terms  of  local 
coordinates  in  which  the  boundary  (or  interface)  is  a coordinate  line. 

In  particular,  with  such  coordinates  it  is  easy  to  formulate  high-order 
approximations  near  the  boundary;  or  to  introduce  mesh  sizes  that  are 
different  across  and  along  the  interface  (or  the  boundary  layer) ; etc. 
Usually  it  is  easy  to  define  suitable  local  coordinates,  and  uniformly 
discretize  them,  but  it  is  more  difficult  to  patch  together  all  these 
local  discretizations. 

A multi-grid  method  for  patching  together  a collection  of  local  grids 
Gl'G2'‘‘"'Gm  ^each  be:i-ng  uniform  in  its  own  local  coordinates)  is  to  relate 
them  all  to  a basic  grid  , which  is  uniform  in  the  global  coordinates 
and  stretches  over  the  entire  domain.  The  relation  is  essentially  as  above 
(Sec.  7.2);  namely,  finite-difference  equations  are  separately  defined  in 
the  inner  points  of  each  grid,  and  the  FAS  multi-grid  process  auto- 
matically combines  them  together  through  its  usual  interpolation  periods. 

A remark:  To  a given  collection  of  local  grids  we  may  have  to  add 

intermediate  grids  to  obtain  fast  multi-grid  convergence.  That  is,  if  a 
given  local  grid  is  much  finer  than  the  basic  grid  G , we  have  to 
add  increasingly  coarser  grids,  all  of  them  uniform  grids  in  the  same 
local  coordinates,  such  that  the  coarsest  of  them  has  a mesh  size  which 
is  (in  the  global  coordinates)  nowhere  much  smaller  than  the  basic  mesh 
size  h . Similarly,  if  the  basic  global  grid  G is  not  coarse  enough, 

w ^ 0 1 M 

the  usual  multi-grid  sequence  of  global  grids  G , G ,...,G  = G^  should 

be  introduced.  Thus,  in  each  set  of  coordinates  we  will  generally  have 
several  grids. 

Such  a system  offers  much  flexibility.  Precise  treatment  of 
boundaries  and  interfaces  by  the  global  coordinates  is  not  required. 

The  local  coordinates  may  be  changed  in  course  of  computations,  e.g.,  to 
fit  a moving  interface.  New  sets  of  local  coordinates  may  be  introduced 
(or  deleted)  as  the  need  arises. 

The  data  structure  required  for  creating,  changing  and  employing 
such  grids  is  basically  again  just  any  data  structure  suitable  for 
changeable  uniform  grids.  This,  however,  should  be  supplemented  by 
tables  for  the  local  transformations,  such  that  one  can  efficiently  (i) 
reproduce  the  local  difference  equation,  and  (ii)  interpolate  from  local 
to  global  grid  points,  and  vice-versa. 
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7.5.  Segmental  Refinement.  The  multi-grid  algorithm  for  non-uniform 
grids  (Sec.  7.2)  can  be  useful  even  in  the  case  of  uniform  grids,  if  the 
computer  memory  is  not  sufficiently  large  to  store  the  finer  levels. 

"Segmental  refinement"  is  the  refinement  of  one  subdomain  at  a time. 

To  see  why  and  how  j^his  is  possible,  observe  that  with  the  FAS  mode  (Sec.  5) 
the  full  solution  u is  obtained  on  all  grids.  But  on  a coarser  grid  G*, 

^ • “)C  If 

the  u solution  satisfies  "corrected"  difference  equation,  with  F = Fjvj 
replacing  F . It  is  therefore  not  necessary  to  keep  the  fine  grid,  once 
F^  has  been  computed. 

The  corrected  forcing  function  FM  can  be  computed  by  segmental  refine- 
ment. Refining  only  one  subdomain,  one  can  use  the  algorithm  a^ove  (Sec. 
7.2)  to  obtain  a multi-grid  solution,  including  tjie  values  of  F^  in  the 
refined  subdomain.  Keeping  this  FM  (instead  of  F ) , one  can  then  discard 
this  refinement,  and  refine  a second  subdomain.  And  so  on,  through  a 
sequence  of  subdomains  covering  the  entire  domain. 

Since  subsequent  subdomain  refinements  change  t£ie  solution  everywhere, 
some  further  changes  are  also  due  in  the  values  of  FM  on  former  subdomains. 
However,  at  points  inner  to  (and  few  meshes  away  from  the  boundary  of) 
such  a former  subdgma^n,  these  further  changes  are  much  smaller  them  the 
first  correction  FM~F  , since  they  represent  changes  in  the  G truncation 
error  due  to  small  smooth  changes  in  the  solution,  while  the  first  cor- 
rection represents  the  full  G truncation  error.  Thus,  if  the  refinement 
segments  are  chosen  so  that  neighboring  segments  overlap  (several  mesh 
intervals  into  each  other) , then  the  further  corrections  may  be  ignored. 

If  extra  accuracy  is  desired,  another  cycle  of  segmental  refinements 
may  be  performed.  Another  way  of  viewing  this  technique  is  to  observe 
that  the  roll  of  the  finer  levels,  relative  to  the  coarser  ones,  is  only 
to  liquidate  high-frequency  error  components  which  cannot  be  "seen"  on 
the  coarser  grids.  These  components  have  a short  (just  few  mesh  sizes) 
coupling  range,  and  can  therefore  be ■ computed  at  any  point  by  refining 
only  few  neighboring  meshes. 

With  this  technique  one  can  operate  the  multi-grid  algorithm  almost  in 
its  full  efficiency,  using  a storage  area  which  is  much  smaller  than  that  of 
the  finest  grid.  This  has  been  confirmed  by  preliminary  (one-dimensional) 
numerical  tests. 

In  principlg,  the  required  storage  area  can  be  reduced  to  only  a con- 
stant cube,  of  J locations,  on  each  level  (where  even  J=20  probably 
offer  enough  overlap  without  substantial  reduction  in  efficiency).  Thus, 
the  overall  storage  requirement  can  in  principle  be  reduced  to 

Jd  {1  + log  £ / log  J} 
n 

locations , where  h is  the  finest  mesh-size  and  R is  the  diameter  of  the 
domain.  No  external  memory  is  needed. 
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8.  ADAPTIVE  DISCRETIZATION  TECHNIQUES . 

The  previous  section  described  a flexible  data  structure  and  solution 
process  which  facilitate  implementation  of  variable  mesh-sizes  h.  The 
difference  equations  in  that  process  are  always  defined  at  inner  points  of 
uniform  subgrids,  which  make  it  easy  to  employ  high  and  variable  approximation 
orders  p.  How,  then,  mesh-sizes  and  approximation-orders  are  to  be  chosen? 
Should  boundary  layers,  for  examples,  be  resolved  by  the  grid?  What  is 
their  proper  resolution?  Should  we  use  high-order  of  approximation  at  such 
layers?  How  to  detect  such  layers  automatical ly?  In  this  section  we  pro- 
pose a general  framework  for  automatic  selection  of  h and  p in  a (nearly) 
optimal  way.  In  the  next  (Sec.  9)  we  will  study  some  special  cases,  and 
show  how  this  proposed  system  automatically  resolve  or  avoid  from  resolving 
thin  layer,  depending  on  the  alleged  goal  of  the  computations. 

8.1  Basic  Principles.  We  will  treat  the  problem  of  selecting  the 
discretization  parameters  h and  p (and  possibly  other  parameters,  see 
Sec.  8.4)  as  an  optimization  problem:  We  will  seek  to  minimize  a certain 

error  estimator  E,  subject  to  a given  amount  of  computational  work  W. 

(Or,  equivalently,  minimize  the  work  W to  obtain  a given  level  E of 
the  error  estimator.  We  will  see  that  the  actual  control  quantity  is 
neither  E nor  W,  but  their  rate  of  exchange.)  It  is  important,  however, 
to  promptly  emphasize  that  we  should  not  take  this  optimization  too 
pedantically  it  is  enough,  for  instance,  to  obtain  E which  is  one  or  two 
orders  of  magnitudes  larger  than  the  minimum  (or,  equivalently,  to  invest 
work  W which  is  by  some  fraction  more  than  theoretically  needed.  Note 
below  that  log (1/E  . ) is  usually  proportional  to  W) . Full  optimization 
is  not  our  purpose1?1 is  enormously  harder  and,  in  fact,  is  self-defeating, 
since  it  requires  too  much  computational  work  to  be  invested  in  controling 
h and  p.  We  will  aim  at  having  the  control  work  much  smaller  than  the 
actual  numerical  work  W,  using  the  optimization  problem  only  as  a loose 
directive  for  sensible  discretization. 

The  Error  Estimator  E is  a functional  that  estimates  the  overall  error 
in  solving  the  differential  boundary-value  problem,  in  terms  of  any  given 
numerical  approximation.  In  principle,  such  a functional  should  be 
furnished  whenever  a problem  is  submitted  for  numerical  solution;  in 
practice,  it  is  seldom  provided.  To  have  such  an  estimator  depends  on 
having  a clear  and  well-defined  idea  about  the  goal  of  the  computations , 
i.e.,  an  idea  about  what  error  norm  we  intend  to  minimize.  Given  the  goal, 
even  roughly,  we  can  usually  formulate  E quite  easily.  We  assume  that 
the  numerical  approximation  U is  in  some  suitable  neighborhood  of  the 
true  solution  (this  is  a necessary  and  justifiable  assumption;  see  Sec. 

8.2),  so  that  E can  be  written  as  a linear  functional 

(8.1)  E = J G(x)  T(x)  dx  . 

n 

x(x)  is  a local  estimate  of  the  truncation  error  i.e.,  the  error  by  which 
the  numerical  solution  u fails  to  satisfy  the  differential  equation  LU=F; 
or  more  conveniently,  the  error  by  which^t|ie  differential  solution  U 
fails  to  satisfy  the  discrete  equation  L U =F.  That  is. 
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(8.2) 


t(x) 


= |LU(x)  - L^fx)  | 


G(x)  is  the  non-negative  "error-weighting  function"  (or  distribution) , 
through  which  the  computations  goal  should  be  expressed. 

The  choice  of  G can  be  crude.  In  fact,  multiplying  G by  a con- 
stant does  not  change  our  optimization  problem.  Also,  we  can  make  large 
errors,  up  to  one  or  two  orders  of  magnitudes,  in  the  relative  values 
of  G at  two  different  points,  since  we  are  content  in  having  E only  to  that 
accuracy.  What  matter  are  only  large  changes  in  G,  e.g.,  near  boundaries. 
For  example,  if  we  have  a uniformly  elliptic  problem  of  order  m,  and  if 
we  are  interested  in  computing  good  approximations  to  U and  its  derivatives 
up  to  order  l and  upto  the  boundary,  then  a suitable  choice  is 


(8.3) 


G (x) 


m/2- 1 
a 

x 


where  d is  the  distance  of  x from  the  boundary.  (The  formula  should  be 
suitably  modified  near  a boundary  corner) . This  and  similar  choices  of 
G are  easily  found  by  local  one-dimensional  crude  analysis  of  the  relation 
between  a perturbation  in  the  equations  and  the  resulting  perturbation 
in  the  quantity  we  wish  to  approximate.  Even  though  crude,  such  choice  of 
would  specify  our  goal  much  closer  than  people  usually  bother  to.  More- 
over, we  can  change  G if  we  learn  that  it  fails  to  properly  weigh  a certain 
region  of  the  computation;  it  can  serve  as  a convenient  control,  conveying 
our  intentions  to  the  numerical  discretization  and  solution. 


The  Work  Functional  W.  In  solving  the  discrete  equations  by  the  multi- 
grid method,  the  main  overall  computational  work  is  the  number  of  Work 
Units  invested  in  relaxations,  times  the  amount  of  confutations  in  each 
Work  Unit  (see  Sec.  6) . If  the  discretization  and  relaxation  schemes 
are  suitable,  the  number  of  Work  Units  is  almost  independen^yf  the 
relaxation  parameters  h and  p.  (See  e.g.,  the  rate  y for  A^  vs.  A^ 
in  Table  1 above) . Since  for  our  optimization  problem  we  need  W only 
up  to  a multiplicative  constant,  we  can  take  into  account  only  the  amount 
of  computations  in  a single  Work  Unit,  i.e.,  the  work  in  one  relaxation 
sweep  oygr  the  domain.  The  local  number  of  grid  points  per  unit  volume 
is  h(x)  , and  the  amount  of  computation  at  each  grid  point  is  a function 
w(p(x)) , where  p(x)  is  the  local  order  of  approximation.  Hence,  we  can 
regard  the  work  functional  as  being 


Global  Optimization  Equations.  Treating  the  discretization  parameters 
as  spatial  variables,  h(x)  and  p(x) , the  Euler  equations  of  minimizing  E 
for  fixed  W are 


(8.5a) 


(8.5b) 


3E 

. x 3W 

3h  (x) 

3h(x) 

3E 

. x 3W 

3p(x) 

3p(x) 
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where  X is  a constant  (the  Lagrange  multiplier).  It  is 
X is  actually  the  marginal  rate  of  exchange  between  work 


accuracy,  i.e. 


(8.6) 


X = 


dE  . 
min 

dW 


= E 


lQ<3  I 

dW 


easily  seen  that 
and  optimal 


and  the  meaning  of  (8.5)  is  that  we  cannot  lower  E by  trading  work  (e.g., 
by  taking  smaller  h at  one  point  and  larger  at  another,  keeping  W 
constant,  or  trading  a change  in  h with  a change  in  p) . 


Equations  (8.5)  make  some  essential  simplifications  in  the 
optimization  problem:  They  regard  h and  p as  defined  at  all  points 

xcft;  Also,  h and  p are  assumed  to  be  continuous  variables,  whereas  in 
practice  they  are  discrete.  (p  should  be  a positive  integer,  in  some 
schemes  a positive  even  integer.  Values  of  h are  restricted  by  some 
grid-organization  considerations.)  These  simplifications  are  crucial  for 
our  approach,  and  they  are  altogether  justified  by  the  fact  that  we  are 
content  in  having  only  an  approximate  optimum.  The  practical  aspect,  of 
choosing  permissible  h and  p close  to  the  solution  of  (8.5),  is  discussed  in 
Sec.  (8.3).  One  restriction  we  should,  however,  take  into  account  in 
the  basic  equations,  namely,  the  restriction 


(8.7)  PQ  i.  p(x)  £ P1(*) • 


Without  such  a restriction,  the  optimization  equations  may  give  values 
of  p which  cannot  be  approximated  by  permissible  values.  pQ  is  usually 
1 or  (in  symmetric  schemes)  2.  The  upper  bound  p^  may  express  the  high- 
est feasible  order  due  to  round-off  errors;  or  the  highest  order  for  which 
we  actually  have  appropriate  (stable)  discretization  formulae,  with  special 
such  restriction  near  boundaries  (hence  the  possible  dependence  of  p on 
the  position  x) . With  this  restriction,  Euler  equation  (8.5b)  should  be 
rewritten  as 


(8.8) 


BE 

Bp(x) 


+ X 


Bw 

Bp  (x) 


1 °' 

if 

P(x)  = PQ 

= O, 

if 

PQ  < P(x)  < Pj_ 

(x) 

<_  o, 

if 

p ( x)  = p (x)  . 

Substituting 

(8.1)  and  (8.4) 

into 

(8.5a)  and  (8.8),  we  get  the  following  equations  at  each  point  x e ft: 


(8.9a) 


8.9b) 


3t  Xd  w (p) 

Bh  “ , d+1 

n 


0 
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where  the  equality-inequality  sign,  in  (8.9b)  and  hereinafter,  correspond? 
to  the  three  cases  introduced  in  (8.8).  In  principle,  the  pair  of 
equations  (8.9)  determines,  for  each  x e ft,  the  local  values  of  the  pair 
(h,p) , once  X is  given. 

Thus  X is  our  global  control  parameter.  Choosing  larger  X,  we  will 
get  an  optimized  grid  with  less  work  and  poorer  accuracy;  lowering  X,  we 
invest  more  work  and  get  higher  accuracy.  For  each  X,  however,  we 
get  (approximately)  the  highest  accuracy  for  the  work  invested.  In 
principle  X should  be  given  by  whoever  submits  the  problem  for  numerical 
solution;  i.e.,  he  should  tell  at  what  rate  of  exchange  he  is  willing 
to  invest  computational  work  for  additional  accuracy  (see  (8.5)).  In 
practice  this  is  not  done,  and  X usually  serves  as  a convenient  control 
parameter  (see  Secs.  8.2  and  8.3). 

To  compute  h and  p from  (8.9)  we  should  know  the  behavior  of  t as  a 
function  of  h and  p.  Generally, 

(8.10)  t (x,h ,p)  ^ t ( x , p)  hP, 

where  t(x,p)  depends  on  the  equations  and  on  the  solution.  Since  it  is 
assumed  that  all  our  numerical  approximations  are  in  some  neighborhood 
of  the  solution  (see  Sec.  8.2),  we  may  assume  that  the  truncation-error 
estimates,  automatically  calculated  by  the  multi-grid  processing  (see  (5.7), 
for  example) , give  us  local  estimates  for  t(x,p) . In  practice,  we  never 
actually  solve  (8.9) , but  use  these  relations  to  decide  upon  changes  in 
h and  p (see  Sec.  8.3),  so  that  we  need  to  estimate  x(x,h,p)  only  for 
h and  p close  to  the  current  h(x)  and  p(x). 

8.2.  Continuation  Methods.  Continuation  methods  are  generally  used 
in  numerical  solutions  of  nonlinear  boundary  value  problems.  A certain 
problem-parameter , y say,  is  introduced,  so  that  instead  of  a single 
isolated  problem  we  consider  a continuum  of  problems,  one  problem  P(y) 
for  each  value  of  y in  an  interval  y < yt,  where  P(y  ) is  easily 

solvable  (e.g.,  it  is  linear),  and  P?yJ  is  the  target  (the  given)  pro- 
blem. The  continuation  method  of  solution  is  to  advance  y from  y^  to  yA 
in  steps  6y.  At  each  step  we  use  the  final  solution  of  the  previous  step 
(or  extrapolation  from  several  previous  steps)  as  a first  approximation  in 
an  iterative  process  for  solving  P(y).  The  main  purpose  of  such  con- 
tinuation procedures  is  to  ensure  that  the  approximations  we  use  in 
the  iterative  process  are  always  "close  enough"  to  the  solution  (of  the 
current  P (y ) ) , so  that  some  desirable  properties  are  maintained.  Usually 
y is  some  natural  physical  parameter  (the  Reynolds  number,  the  Mach  number, 
etc.)  in  terms  of  which  either  the  differential  equations  or  the  boundary 
conditions,  or  both,  are  expressed. 

The  continuation  process  is  not  a waste,  for  several  reasons.  In 
many  cases,  the  intermediate  problems  P(y)  are  interesting  by  themselves, 
since  they  correspond  to  a sequence  of  cases  of  the  same  physical  problem. 
More  importantly,  in  solving  non-linear  discretized  problems  the  continua- 
tion process  is  not  only  a method  of  computing  the  solution,  but  also,  in 
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effect,  the  only  way  to  define  the  solution,  i.e.,  the  way  to  select  one 
out  of  the  many  solutions  of  the  non-linear  algebraic  system.  The  desired 
solution  is  defined  as  the  one  which  is  obtained  by  continuous  mapping 
from  [yqf  Y*]  to  the  solution  space  with  a given  solution  at  y (e.g., 
the  single  solution,  if  P(y  ) is  linear).  By  the  continuation  process, 
we  keep  every  intermediate  numerical  solution  in  the  vicinity  of  a 
physical  solution  (to  an  intermediate  problem),  hence  the  target  numerical 
solution  is,  hopefully,  near  the  target  physical  solution,  and  is  not 
some  spurious  solution  of  the  algebraic  system.  Thus,  although  sometimes 
we  may  get  away  without  a continuation  process  (singly  because  a starting 
solution  is  "close  enough",  so  that  the  continuation  may  be  done  in  just 
one  step) , in  principle  a continuation  process  must  be  present  in  any 
numerical  solution  of  non-linear  problems.  Moreover,  such  a process  is 
usually  inexpensive,  since  it  can  be  done  with  crude  accuracy,  so 
that  its  intermediate  steps  usually  total  less  computational  work  than 
the  final  step  of  computing  an  accurate  solution  to  P(y*) • 

A continuation  process  is  necessary,  in  principle,  not  only  for  non- 
linear problems,  but  also  for  linear  problems  with  grid  adaptation.  In 
fact,  when  h or  p are  themselves  unknown,  the  discrete  problem  is 
nonlinear,  even  if  the  differential  problem  is  linear. 

In  our  system,  a continuation  process  with  crude  accuracy  and 
little  work  is  automatically  obtained  by  selecting  a large  value  for  the 
control  parameter  X (cf.  Sec.  8.1).  Then,  in  the  final  step  <Y=Y*)  » ^ is 
decreased  to  refine  the  solution.  Thus,  the  overall  process  may  be 
viewed  as  a multi-grid  process  of  solution,  controlled  by  the  two  para- 
eters  y and  1. 

.Tne  most  efficient  way  of  changing  y is  probably  to  change  it  as 
soon  as  possible  (e.g.,  when  the  multi-grid  processing  exhibits  convergence 
to  a crude  tolerance)  and  to  control  the  step-size  6y  by  some  automatic 
procedure,  so  that  6y  is  sharply  decreased  when  divergence  is  sensed  (in 
the  multi-grid  processing) , and  slowly  increased  otherwise. 

In  changing  y it  is  advisable  to  keep  the  residuals  as  smooth  as 
possible,  since  higher  frequency  components  are  more  expensive  to  liquidate 
(lower  components  being  liquidated  on  coarser  grids) . Thus,  for  example, 
if  a boundary  condition  should  be  changed  while  changing  y,  it  is  advisable 
to  introduce  this  change  into  the  system  at  a stage  when  the 
algorithm  is  to  start  working  on  the  coarsest  grid. 

y-Extrapolation.  In  some  cases  the  given  problem  (Y=Y*)  is  much  too 
difficult  to  solve,  e.g.,  because  the  differential  solution  fluctuates 
on  a scale  too  fine  to  be  resolved.  In  such  cases  one  is  normally  not 
interested  in  the  details  of  the  solution  but  rather  in  a certain 
functional  of  the  solution.  It  is  sometimes  possible  in  such  cases  to 
solve  the  problem  for  certain  values  of  y far  from  y*,  and  to  extrapolate 
the  corresponding  functional  values  to  Y=Y*  • 
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8.3.  Practice  of  Discretization  Control.  The  main  practical  re- 
strictions imposed  on  the  theoretical  discretization  equations  (8.9)  are 
the  following:  The  approximation  order  p should  be  a positive  integer. 

In  many  problems  p is  also  restricted  to  be  even,  since  odd  orders  are 
less  efficient.  The  mesh-size  function  h(x)  should  be  such  that  a reasonable 
grid  can  be  constructed  with  it.  Thus,  in  the  gg id  structure  outlined  in 
Sec.  7.1,  h is  restricted  to  be  of  the  form  h=2  h , where  k is  an  integer. 
Also,  in  the  multi-grid  discretization  method  outlined  in  Sec.  7.2,  any 
uniform  subgrid  truly  influences  the  global  solution  only  if  it  is  large 
enough,  i.e.,  if  at  least  some  of  its  inner  points  belong  also  to  coarser 
grids.  These  discretization  restrictions  will  actually  help  us  in  meeting 
another  practical  requirement,  namely,  the  need  to  keep  the  control-work 
(computer  work  invested  in  testing  for  and  affecting  discretization  refor- 
mulations) small  compared  with  the  numerical  work  (relaxation  sweeps  and 
interpolations) . 

The  practical  adaptive  procedure  is  proposed  to  be  generally  along 
the  following  lines: 

A.  Testing.  In  the  multi-grid  solution  process  (possibly  incorporating 
a continuation  process) , at  some  natural  point  we  get  an  estimate  of  the 
decrease  in  the  error  estimator  E introduced  by  the  present  discretization 
parameters.  For  example,  in  FAS  Cycle  C (see  its  flowchart  in  Fig.  2),  at 
the  point  where  new  F is  computed,  the  quantity 


(8.11) 
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at  each  point  may  serve  as  a local  estimate  for  the  decrease  in  E per 
unit  volume  (cf.  (8.1)  and  (5.7))  , owing  to  the  refinement  from  h^  to  h^+^. 
Each  such  decrease  in  E is  related  to  some  additional  work  AW  (per 
unit  volume) 
the  additional  work 


For  example,  the  refinement  from  to  h^+1  requires 


(8.12) 


AW  = ^ 
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(per  unit  volume) 


Hence  we  compute  the  ratio  of  exchanging  accuracy  per  work  Q = - AE  / AW. 

At  regions  where  this  ratio  is  much  bigger  than  X (the  control  rate  of 
exchange;  cf.  Sec.  8.1)  we  say  that  the  present  parameter  (h^+1  in  the 
example)  is  highly  profitable  and  itisworth  trying  jt^further  refine  the 
discretization  (e.g.,  introduce  there  the  subgrid  G with  h = h]t+1  /2)  . 
At  regions  whejC|^Q  is  much  smaller  than  X we  may  coarsen  the  discretization 
(abolish  the  G*  1 subgrid) . 

Extrapolated  tests . More  sophisticated  tests  may  be  based  on  assuming 
the  truncation  error  to  have  some  form  of  dependence  on  h and  p,  such  as 
(8.10)  above.  Instead  of  using  AE  and  AW  at  the  previous  change  (from 

h to  h , in  the  above  example)  we  can  then  anticipate  the  corresponding 
k k+1  

values  AE  and  AW  at  the  next  change  ( from  h^+^  to  h^^)  • which  are 

the  more  appropriate  values  in  testing  whether  to  make  that  next  change. 


4* 


\ 


549 


t 


Thus,  in  the  above  example,  assuming  (8.10)  and  = h^+1/2  = h^/4,  we 

get  AE  = 2~P  AE  , AW  = 2d  AW,  and  hence 


(8.13) 


, d | -k  k -k+1  | 
q = zM.  = 2-p_d  Q = -h,C+1  G F ~ lk+1  F 

AW 


w(p)  (2d  -1)  2P 


The  extrapolated  ratio  Q is  used  in  testing  for  grid  changes.  This  may 
seem  risky,  since  it  depends  on  assuming  (8.10) . But  in  fact  there  is 
no  such  risk,  because  we  can  see  from  (8.13)  that  testing  with  Q is  not 
that  much  different  from  testing  with  Q.  (In  fact,  if  p is  constant, 
testing  with  Q is  equivalent  to  testing  with  Q against  another  constant  X.) 
Anj3+jhe  test  with  Q does  not  presume  (8.10);  it  only  assumes  that  the  finer 
(G  ) approximation  is  considerably  better  than  the  coarser  one,  so  that 
their  difference  roughly  corresponds  to  an  added  accuracy  due  to  the 
refinement.  Note  also  that  the  multi-grid  stopping  criteria 
((A. 17)  or  (A. 20)  in  App.  A)  are  precisely  such  that  Q can 
be  reliafcly  computed  from  the  final  approximation. 


B.  Changing  the  discretization.  The  desired  grid  changes  are  first 
just  recorded  (e.g.,  incidentally  to  the  stage  of  computing  F ) and  only 
then  they  are  simultaneously  introduced,  taking  into  account  some  organ- 
izational and  stabilizational  considerations:  A change  (e.g.,  refinement) 

is  introduced  only  if  there  is  a point  where  the  change  is  "overdue" 

(e.g. , a point  where  Q > 10X) . Together  with  such  a point  the  change  is 
then  also  introduced  at  all  neighbor  (and  neighbor  of  neighbor,  etc.) 
pojly^s  where  the  change  is  "due"  (e.g.,  where  Q > 3X)  . The  changed  subgrid 
(G  in  the  above  example)  is  then  augmented  as  follows:  (i)  Around  each 

new  grid  point  we  add^^tra  points,  if  necessary,  so  that  the  grid  point 
(corresponding  to  a G point  where  a refjLri^ment  was  due)  becomes  an  inner 
point  (cf.  Sec.  7.2)  in  the  new  subgrid  (G  ).  (ii)  Holes  are  filled; 

that  is,  if,  on  any  grid  line,  a couple  of  points  are  missing  in  between 

grid  points,  these  missing  points  are  added  to  the  grid. 

The  control  work  in  this  system  is  negligible  compared  with,  say, 

k+1 

the  work  of  relaxjLijij  over^G  , because:  (i)  The  tests  are  maue  in 

transition  from  G*  to  G , which  takes  place  only  once  per  several  G 

relaxation  sweeps.  (ii)  Q is  computed  and  tested  only  at  points  of  the 

coarser  grid  G , and  at  each  such  point  the  work  is  smaller  than  the  relax- 
ation work  per  point.  (iii)  Changing  the  discretization  is  itself  inex- 
pensive since  it  is  done  by  extending  or  contracting  uniform  grids  (cf. 

Sec.  7.1),  the  main  work  being  in  interpolating  the  approximate  solution 
to  the  new  piece  of  uniform  subgrid. 

8.4  Generalizations.  In  some  problyys  i^2fs  not  enc(>S?h  tQ  adapt  h 

and  p.  Sometimes  different  increments  h , h . ...,  h should 

( i ) 

be  used  at  the  d different  directions,  and  each  h should  be  separately 
adapted.  Basically  the  same  procedures  ay . ybove  yay  be  used  to  test 
and  execute,  for  example,  a change  from  h ^ to  h ^ /2.  More  generally, 
one  would  like  to  adapt  the  local  coordinates  (cf.  Sec.  7.4) , e.g.,  near 
discontinunities . Automatic  procedures  for  such  adaptation  have  not 
been  so  far  developed,  but  are  conceivable. 
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Other  discretization  parameters,  such  as  the  centering  of  each 
term  in  the  difference  operator,  may  be  treated  adaptively.  (In  fact, 
such  adaptive  discretization  is  already  in  use  in  mixed-type  problems, 
where  it  was  introduced  by  Murman  to  obtain  stability.  See,  e.g.,  [9]). 
In  problems  with  unbounded  domains,  the  discrete  domain  may  be  determined 
adaptively  (with  increasingly  coarser  levels;  cf.  Sec.  7.1)  , using  a pro- 
cedure that  decides  to  extend  the  domain  if  the  previous  extension  was 
highly  profitable  in  terms  of  -AE/AW.  In  many  problems,  some  terms  jin 
the  difference  operator  can  altogether  be  discarded  on  most  levels  G . 

In  particular,  in  singularly  perturbed  problems,  the  highest  order 
terms  may  be  kept  only  on  the  finest-narrowest  levels.  Decision  can 
again  be  made  in  terms  of  -AE/AW,  in  an  obvious  way. 


9.  ADAPTIVE  DISCRETIZATION : CASE  STUDIES. 


To  get  a transparent  view  of  the  discretization  patterns  and  the 
accuracy-work  relations  typical  to  the  adaptive  procedures  proposed  above, 
we  consider  now  several  test  cases  which  are  simple  enough  to  be  analyzed 
in  closed  forms.  That  is,  we  consider  problems  with  known  solutions  and 
simple  behavior  of  the  local  truncation  errors,  and  we  calculate  the 
discretization  functions  h(x)  and  p(x)  that  would  be  selected  by  the  local 
op  mization  equations  (8.9),  and  the  resulting  relation  between  the  error 
estimator  E and  the  computational  work  W. 


9.1  Uniform-Scale  Problems.  A problem  is  said  to  have  the  uniform 
scale  n(x)  if  the  local  truncation  error  (8.2)  has  the  behavior 


(9.1) 


t (x,h,p)  ^ t(x) 


n(x) 


(p  <P1P,I 


Such  a behavior  occurs,  for  example,  when  the  solution  is  a trigonometric  or 
exponential  function  exp(0*x),  where  0 is  either  a constant  or  a slowly 
varying  function  (see  example  in  Sec.  9.2) . We  will  also  assume  for  sim- 
plicity that  (see  (8.4)) 


(9.2) 


w(p) 
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w p 
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Usually  8,=  1,  since  the  number  of  terms  in  the  difference  equations,  and 

hence  also  the  amount  of  computer  operations  at  each  grid  point,  are 

proportional  to  p.  1=2  is  appropriate  if  we  assume  that  we  have  to 

increase  the  precision  of  our  arithmetic  when  we  increase  p.  Rescaling 

W,  we  can  assume  that  w =1. 

o 

Using  (9.1-2)  in  equations  (8.9)  we  get 
8,-1  -d 

(9. 3a)  Gx  = Xd  p h , 

h 8.-1  -d  > 

(9.3b)  Gt  log  - + H p h = o . 

n < 

'V  . 

Hence,  denoting  by  p the  value  of  p that  satisfies 
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(9.4) 
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"adaptive" , 

i.e.,  dependent  of  A.  Where  p is  adaptive  (p^  £_  p = p £ P-^)  , h is  fixed  and 
each  "scale  cube"  is  divided  into  e*'  mesh  cells. 

Assume  now  further  that  the  computer  precision  is  unlimited  (which  is 
never  really  the  case,  but  may  provide  insight),  so  that  £.=  1 and  p.=°°.  If 
sufficiently  high  accuracy  is  desired,  then  A is  sufficiently  small  to  have 
p>p  , so  that  (9.5a)  applies.  By  (8.1)  and  (9.Ja)  this  implies 


(9.6)  E = Ade  J r\  dx  , 
and  hence,  by  (8.6), 


(9.7)  E = 0 e-W/(ed/n_ddX)  = C e-°  ^ , 

o o 

where  6 is  some  average  value  of  the  scale  n (x) . In  this  (idealized) 
case,  E decreases  exponentially  with  W.  For  realistic  W this  convergence 
rate  becomes  poor  when  8 is  very  small,  as  in  singularly  perturbed  problems. 
In  such  problems,  however,  for  realistic  W (9.5a)  no  longer  applies, 
and  another  rate  of  convergence,  independent  of  8,  takes  over  (see  Sec. 

9.3)  . 


9.2.  One-Dimensional  Case.  Consider  a 2-point  boundary -value  problem 

(9.8)  3.  SLH.  + = o , in  0<x<l , 

dx" 


with  constant  n>0  and  With  boundary  conditions  U(0)  and  U(l)  such  that 
the  solution  is  U = e_2x  . An  elliptic  (stable)  difference  approximation 
to  such  an  equation  can  be  central  for  n>h  but  should  be  properly  directed 
for  n<h.  (The  first  order  term  being  the  main  term,  the  second  order 
term  should  be  differenced  backward  relative  to  it  with  approximation 
order  p’  = p - [logn/loghj . See  (4)  and  Sec.  3.2  in  [3]).  In  either  case, 
the  truncation  error  is  approximately 
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(9.9)  t (x,h,p)  = t ( x 

We  now  choose  the  error  weighting  function  to  be 

(9.10)  G(x)  i 1, 


. . , . 1 -2x/n 

where  t (x)  = — e 

2n 


which  would  be  the  choice  (see  (8.3))  when  one  is  interested  in  accurate 
computation  of  boundary  first-order  derivatives  (corresponding,  e.g.,  to 
boundary  pressure  or  drag,  in  some  physical  models).  We  again  assume  no 
precision  limitations,  so  that  S.=l  and  p=“>.  We  take  p =2  since  second- 
order  is  no  more  expensive  than  first-order  approximations.  Inserting 
these  into  (9.5)  we  get 


(9.11a) 

(9.11b) 
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p = 2 , for  x < x < 1 , 
o — 


where 

(9.11c)  x0  = 2 (logif  " 3)  • 


If  x > 1,  then  (9.11a)  applies  throughout,  and  hence 
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and  the  condition  x > 1 itself  becomes,  by  (9.11c,  12), 
o — 
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Thus,  if  W satisfies  (9.14),  E converges  like  (9.13). 


9,3.  Singular  Perturbation:  Boundary  Layer  Resolution.  When  g is 

very  small,  problem  (9.8)  is  singularly  perturbed,  and  its  solution  has 
a boundary  layer  near  x=0.  The  above  mesh-size  h=n/e  is  too  small  to 
be  practical.  Indeed,  in  the  optimal  discretization  (9.11) , for  small  n we 
get  small  x , and  an  "external  region"  x < x < 1 is  formed  where  the 
mesh  size  grows  exponentially  from  n/e.  The  small  mesh  size  is  used 
only  to  resolve  the  boundary  layer.  In  this  simplified  problem  the  solution 
away  from  the  boundary  layer  (i.e. , for  x>>n)  is  practically  constant, 
so  that  indefinitely  large  h is  suitable.  Usually  h will  grow  exponentially, 
as  in  (9.11b),  from  h = — to  some  definite  value  suitable  for  the 
external  region.  In  the  transition  region  we  have  p=2 , i.e.,  the  minimal 
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order  of  differencing  is  used  in  the  reqion  where  h chanqes.  This  may 
be  useful  in  practical  implementations. 

From  (9.11)  and  (9.9)  we  qet  for  small  n 


(9.15) 
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where  the  integrals  axe  separately  calculated  .in  (o,X  ) 
E converges  exponentially  as  a function  of  W / instead 
rate  is  independent  of  n and  does  not  deterioriate  as  n 


(o,X  ) and  (X  ,1).  Thus, 
instead  of  W,  but  this 


9.4.  Singular  Perturbation  without  Boundary-Layer  Resolution.  To  see 
the  effect  of  choosing  different  error  weighting  functions,  consider 
again  the  above  problem  (Secs.  9.2,  9.3),  but  with  the  choice  G(x)  = x. 

This  choice  is  typical  to  cases  where  one  is  not  interested  in  calculating 
boundary  derivatives  of  the  solution  (see  (8.3)).  We  then  get 


(9.17) 
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Therefore,  for  small  n and  reasonable  X,  p < o and  p=2  for  all  x.  Hence, 
no  resolution  of  the  boundary  layer  is  formed.  Indeed,  by  (9.5b),  for 
very  small  n (singular -perturbation  case) 


(9.18) 
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so  that  h»n  . In  practical  situation  where  the  solution  in  the  external 
region  is  not  constant , the  actual  mesh-size  will  be  determined  by  the 
external  regime . 


9.5.  Boundary  Corners.  Consider  the  two-dimensional  Poisson  equation 
AU=F  with  smooth  F and  Ijiomoqeneous  boundary  conditions,  near  a boundary 
comer  with  angle  tt/a,  j <_  a < 1.  Denoting  by  r the  distance  from  the 
comer,  at  small  r the  solution  U is  0(r  ) , and  so  is  also  the  error 
weighting  function  G (if  accuracy  is  sought  in  the  solution^but  n°t 
in  its  derivatives  near  the  boundary) . Hence,  t = 0(h  r ) and 

9t /3h  = 0(hP~ra~P  ; . If  we  fix  the  order  of  approximation  p,  then  the 
optimal  mesh-spacing  derived  from  (8.9a)  is 


Hence,  by  (8.4)  and  (8.1)  the  total  work  and  total  error  contribution  from 
a region  of  radius  r around  the  corner  are,  respectively, 


w = /*£ 
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E =/G 


dxdy  = 0(A 
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E =J  G t dxdy  = 0(Ap/(p+2)  r2~2B)  . 

Hence  the  relation  E W P^  (the  usual  relation  in  d-dimensional  smooth 
problem  with  p-th  order  approximation)  still  holds  uniformly.  The  corner 
does  not  "contaminate"  the  global  convergence. 

• • If 

In  the  practical  grid  organization  (Sec.  8.3)  finer  levels  G with 

increasingly  smaller  mesh-sizes  h,  = 2 h will  be  introduced  near  the 

k k o 

corner.  By  (9.19),  the  level  G will  extend  from  the  corner  to  a dis- 
tance r^  = C A2ot  p 2 h^1/>B . Since  6<1,  for  small  h^  we  get  h^  > r^.  This 
gives  us  in  practice  a natural  stopping  valup  for  the  refinement  process: 

The  finest  mesh-size  near  the  comer  is  such  that  1l  ^ 4r  , so  that  level 

k k-1  K * 

G still  has  an  inner  point  belonging  to  G 

9,6.  Singularities.  Like  boundary  corners , all  kinds  of  other  pro- 
blem singularities,  when  treated  adaptively,  cause  no  degradation  of  the 
convergence  rate  (of  E as  function  of  W) . 

Consider  for  example  the  differential  equation  LU=F  where  F is 
smooth  except  for  a jump  discontinuity  at  x=0.  Whatever  the  approximation 
order  p,  the  system  will  find  -AE  (see  (8.11))  to  be  0(1)  at  all  points 
whose  difference  equation  include  values  on  both  sides  of  the  discontinuity. 
At  these  points  further  refinements  will,  therefore,  be  introduced  as  long 
as  -AE/AW  > 0(A).  Thus,  around  x=0,  some  fixed  number  (depending  only 
on  p)_of  mesh  points  will  be  introduced  at  each  level  Gk , until  a mesh 
size  h = 0(A  ) is  reached.  The  total  amount  of  added  work  is  there^ 

fore  proportional  to  the  number  of  levels  introduced,  which  is  0(log  h) . 

The  error  contribution  of  the  discontinuity  is  0(h  ) , which  is  exponentially 
small  in  terms  of  the  added  work. 

This  and  similar  analyses  show  that  the  adaptive  scheme  retains  its 
high-order  convergence  even  when  the  problem  is  only  piecewise  smooth, 
or  has  algebraic  singularities,  etc. 


10.  HISTORICAL  NOTES  AND  ACKNOWLEDGEMENTS. 

Coarse-grid  acceleration  techniques  were  recommended  and  used  by  several 
authors,  including  Southwell  [24,13,14],  Stiefel  [15],  Fedorenko  [5],  Ahamed 
[19],  Wachspress  [17],  de  la  Valde  Poissin  [16]  and  Settari  and  Aziz  [24]. 


Southwell  called  his  technique  "block"  and  more  generally  "group  relax- 
ation", described  it  as  "almost  essential  to  practical  success",  and  gave 
heuristic  explanation  as  well  as  practical  implementation  methods  based  on 
variational  considerations  ("the  aim  being  to  reduce  the  total  enerqy  by 
as  great  an  amount  as  possible") . He  also  depicted  procedures  of  "advance 
to  a finer  net"  [14] . Techniques  of  multiplicative  coarse-grid  corrections 
(special-cases  of  which  appeared  in  [14],  [19])  were  developed  by  Wachspress 
([17],  Chapter  9),  who  called  them  "variational  techniques".  This  work 
motivated  several  studies,  by  Froelich,  Wagner,  Nakamura  and  Reed  (see  a 
brief  survey  in  [18])  and  was  applied  in  nuclear  reactor  design  computations. 

All  these  were  two-level  methods.  The  nrulti-grid  idea  was  introduced 
by  Fedorenko  [6],  mainly  for  theoretical  purposes.  Namely,  he  rigorously 
proved  that  W(n,e),  the  number  of  operations  required  to  reduce  the  residuals, 
of  a Poisson  problem  on  a rectangular  grid  with  n points,  by  a factor  e,  is 
O(n|loge|).  Bakhvalov  [1]  generalized  this  result  to  any  second-order 
elliptic  operator  with  continuous  coefficients.  For  large  n,  this  is  the 
best  possible  result  - except  for  the  actual  value  of  the  coefficient.  The 
Fedorenko  estimate  can  be  written  as 

W(n,.01)  £ 210000n  + W(l<£  ,.01) , 

and  the  Bakhvalov  constants  are  still  much  larger.  For  admissible  values 
of  n these  estimates  are  therefore  far  worse  than  estimates  obtained  in 
other  methods,  and  they  did  not  encourage  any  development  of  the  method. 
Fedorenko  experimented  with  a two-level  algorithm  only,  and  seemed  to  imply 
that  for  practical  grid  sizes  ADI  may  be  more  efficient.  He  did  not  realize 
the  true  practical  potential,  in  both  efficiency  and  programming  simplific- 
ation, of  a full,  systematic  multi-grid  approach.  (It  can  be  proved  that 
W(n, . 01)  £ 106n,  and  in  practice  W(n,.01)  ^ 5Qn  is  obtainable.  See  App.  C) . 

The  first  full  multi-grid  algorithms  and  numerical  tests  were  described 
in  [2].  Our  original  approach  was  to  regard  the  finer  levels  as  "correcting" 
the  coarser  level  (cf.  Secs.  1,  7.2  and  7.5  above).  For  uniform  non-adaptive 
grids  this  approach  turns  out  to  be  equivalent  to  the  one  implied  by  [6] , 
but  fundamentally  it  is  different  and  more  powerful,  since  the  process  is 
not  confined  to  a fixed  discrete  system. 

A systematic  multi-grid  approach  for  a restricted  class  of  problems, 
with  somewhat  different  procedures  of  relaxation  and  transfer  to  coarser 
grids,  is  described  in  [21].  The  multi-grid  method  is  also  portrayed  in  [23]. 

Adaptive  discretization  procedures  were  introduced  by  several  authors. 

See  for  example  [10],  [20],  [21]  and  references  in  [21].  The  present  approach 

is  different,  not  only  in  its  multi-level  setting,  but  also  in  its  basic 
criteria  and  procedures. 
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APPENDIX  A.  INTERPOLATIONS  AND  STOPPING  CRITERIA:  ANALYSIS  AND  RULES. 

The  multi-grid  algorithms  described  above  (Secs.  4 and  5)  need  to  be 
supplemented  with  some  rules  of  interpolations  and  stopping  criteria. 

More  specifically,  for  the  interpolation  I k , transferring  weighted 
residuals  from  a fine  grid  G*  to  thg  next  coarser  grid  G , we  should  ^ 
prescribe  the  weights,  while  for  I , interpolating  corrections  from  G 
back  to  Gk,  the  method  and  order  of  interpolation  should  be  prescribed. 
Stopping  criteria  should  define  convergence  at  the  various  levels  and 
detect  slow  convergence  rates.  Numerical  tests  show  that  the  parameters 
to  be  used  are  very  robust:  Full  efficiency  of  the  multi-grid  algorithm 

is  obtained  for  stopping  parameters  that  do  not  depend  on  the  geometry 
and  the  mesh  size,  and  which  may  change  over  a wide  range  (see,  e.g.  , 
Appendix  B) , provided  the  correct  forms  of  the  stopping  criteria  are 
used,  and  some  basic  rules  of  interpolation  are  observed.  To  find 
the  correct  forms  and  rules,  and  to  determine  the  stopping  parameters, 
we  have  to  analyze  the  Coarse-Grid  Correction  (CGC)  cycle,  £hjch  consists 
of  interpolating  (I  ~)  the  residuals  to  the  coarser  g^id  G , where  the 
residual  problem  is  solved,  a^d  then  interpolating  (1^^)  that  solution 
back  as  a correction  to  the  G approximation. 
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We  can  use  a local  mode  analysis  (for  the  linearized,  coefficient- 
freezed  difference  equations),  similar  to  the  example  in  Sec.  3.  Such 
an  analysis  may  be  inaccurate  for  the  lowest  frequency  modes,  for  which 
the  interaction  with  the  boundary  is  significant.  But  these  lowest  inodes 
are  of  little  significance  in  our  considerations,  since  they  are  efficiently 
approximated  on  the  coarsest  grids  with  little  computational  work,  and 
since  care  will  be  taken  (i)  to  choose  interpolation  schemes  that  do  not  con- 
vert small  low- frequency  errors  into  large  high-frequency  errors;  and  (ii)  to 
stop  relaxation  sweeps  b fore  low-frequency  error  components  become  so  large 
that  they  significantly  feed  the  high  frequencies  (e.g.,  by  boundary  and 
non-linear  interactions).  In  fact,  we  will  see  that  the  dominant  components 
(i.e.,  the  components  that  are  slowest  to  converge  in  the  combined  process 
of  relaxation  and  coarse-g^id  corrections)  are  the  Fourier  components  e 
for  which  | 0 ] is  close  to  pir  , where  (in  a general  d-dimensiona)  problem) 

d 

(A.O)  Q = (0  ,0  , ...,0  ),  0 *X  ='  Z 9.x. , 1 0 1 = max  1 0 . | , 

d j=l  3 3 l<;<d  3 


These,  components  feed  on  each  other  in  the  interpolation  processes  between  G 
and  G , they  are  slower  to  converge  by  relaxation,  and  in  the  CGC  cycles 
they  may  even  diverge. 

To  simplify  the^djscussion  we  will  assume  that  the  mesh-size  ratio 
has  its  usual  value  p=—  , which  is  the  only  one  to  be  used  in  practice 
(cf . Sec.  6.2) . 


A.l.  Coarse  Grid  Amplif ication  Factors.  For  any  given  set  of  dif- 
ference  operators  and  a multi-grid  scheme,  a local  mode  analysis  of 
the  complete  MG  cycle  can  be  made  (cf . App.  C) , and  the  various  parameters 
can  be  optimized.  The  essential  information  can,  however,  be  obtained 
from  a much  simpler  analysis  that  treat  separately  thetwo  main  processes, 
relaxation  sweeps  and  CGC  cycles.  The  smoothing  rate  p (see  Sec.  3)  is  the 
main  quantity  describing  the  relaxation  sweeps.  The  CGC  local  mode  analysis 
is  summarized  below  (for  algebraic  details  see  Sec.  4 . 5 of  [3]) . 


In  the  CGC  analysis,  together  with  jsach  basic  Fourier  component  e 
(0  < | 0 | < y ) we  should  treat  all  the  G components  that  coincide  with 

it  on  G^  \ i.e.,  all  components  e^  X‘^1  (0  < 1 0 " | n ) such  that 
9'  = 0 . (mod  u)  for  j=l,2,...,d.  We  call  such  component  0'  a harmonic 
ol  0.  3We  are  especially  interested  in  those  harmonics  that  are  not 
separated  from  0 by  the  relaxation  sweeps,  e.g.,  the  set 


i 0 • x/h 


9 " = 9 (mod  ii ) 


g(0')  > u 0) ' 
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Denote  by  | T j the  number  of  members  in  this  set.  (Usually  | T | = 2°  , 

“ I I ^ TT  ^ 

where  a is  the  number  of  coordinates  j for  which  |0j|  ^ ) . In  terms 

of  the  0 Fourier  components  and  its  harmonics,  the  CGC  cycle  has  two 
effects : 

(i)  Assuming  the  components  not  in  to  be  comi  iratively  small  when 
the  CGC  cycle  is  entered,  the  set  of  components  in  Tg  s transformed  in 
the  cycle  by  a certain  matrix,  whose  spectral  radius  turns  out  to  be 


fao(0)  ' 

if 

-3 

CD 

II 

(A.  1 ) 

0(0)  = J 

1 max  (1,  o (6) ) , 
v*  o 

if 

It8i  > i. 

where 

-1 

(A. 2) 

o (0)  = |l  - l R(0,0') 
° 

V9') 

Bk-l<29) 

P(0')|  . 

The  functions  p(d'),  R(0,0 ')  and  B (0) 

o * 

are  the  "symbols 

„ , Tk-1  k 

of  1 k ' Xk-1 

and  L , 

respectively,  i.e. , 

k-1  iB'-x/h  i0*x 

I k e = P (0  ) e 


(A. 3) 


k i0*x/h 
Jk-1  6 


R(0,0')  e 


l i9'x/hJl 
L e 


0 " = 0 (mod  it) 
i 0 • x/h . 


= B.(0)  e 


(cf.  (A. 10)  below), 
i0'*x/h 

r 

U=  k,k-l)  . 


(If  L is  a system  of  equations,  and  the  right-hand  side  of  (A. 2)  is  there- 
fore a matrix,  then  a (0)  is  meant  to  be  the  spectral  radius  of  that 
matrix).  For  small  |8|  we  have  |T.|  = 1 and  hence 


(A. 4) 


0(0) 


o (0)  = i - p (o)  + o ( | 0 | p + lei1)  , 


where  p is  the  approximation  order  of  and  ^ (or  the  minimum  of  the 

two)  and  I is  the  order  of  the  I,  interpolation  (1=2  for  linear  inter- 
polation, etc.).  The  principal  CGc  amplification  factor  is 

(A. 5)  o = max  a (0) 

o<e4 

2 


= max  (l,o  ) , 

o 


where 


a 

o 


max 

°<0<- 


°o(9)  * 
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(ii)  The  CGC  cycles  also  generate  new  secondary  harmonics  'yT.  . 
The  rate  of  generating  these,  i.e.,  the  ratio  of  the  new  0"  amplitude  to 
the  old  amplitude  of  the  combined  harmonics,  turns  out  to  be 


(A. 6) 


o1(0") 


-1 


[ R(0 , 0" ) Bk(0")  FLk_1  (2©)  p(0") 


= 0 ( | 6 | I-In)  , 

where  in  is  the  order  of  the  differential  equations. 

It  follows  from  (A. 4, 6)  that  if  p (o)  = 1,  as  it  is  always  chosen 
to  be  (cf.  Sec.  A. 4) , and  if  I>m,  then  components  with  small  1 6 j are  very 
efficiently  reduced  in  the  multi-grid  process. 


A. 2.  The  Coarse-to-Fine  Interpolation  I 


\ 

) \ 


k-1 ' 


On  the  other  hand,  it 


follows  from  (A. 6)  that  if  I<m  then  even  a small  and  smooth  residual 
function  may  produce  large  high-frequency  residuals,  and  significant  amount 
of  computational  work  will  be  required  to  smooth  them  out.  This  effect 
was  clearly  shown  in  numerical  experiments  ([2],  [11]).  Hence  we  have 

The  Basic  Rule:  The  order  of  interpolation  should  be  no  less  than  the 

order  of  the  differential  equations.  (I>M.)  In  particular,  polynominal 
interpolation  should  be  made  with  polynomials  of  degree  >_  m-1. 

Higher  interpolation  orders  (I>M)  are  desired  in  the  initial  stages 
of  solving  a problem,  when  the  residuals  are  (locally)  smooth.  For 
instance,  in  regions  where  the  given  problem  has  smoothness  of  order  q 


(i.e. , F (x)  = T.  Ag  e 


ifi’x/h 


AQ  = 0 ( | 0 | qhq)) 


in  order  to  ensure  that 


the  high-frequency  residuals  remain  0(h"),  at  the  i-th  interpolation  from 
UM  ^ to  UM  the  order  should  be 

(A. 7)  I >_ m + max [q- (i-l)p,  o] . 

(In  jEact,  as  long  as  q > ip,  this  interpolation  need  not  be  followed 
by  G relaxation  sweeps,  since  the  low-frequency  amplitudes  are  still 
dominant.  Relaxation  would  only  feed  from  these  low  components  to  high 
frequency  ones,  causing  additional  work  later.  Still  fetter,  however, 
instead  of  this  multi-grid  mode  without  intermediate  G relaxation, 
is  to  make  a higher-order  correction  on  G ) . 

Eventually,  however,  the  smoothness  of  F (which  is  the  original  re- 
sidual function)  is  completely  lost  in  subsequent  Residuals  and  the  con- 
vergence of  conponents-  in  the  dominant  range  ( j 0 J 'v  — ) becomes  our  main 
concern.  For  these  components,  higher  interpolation  orders  (I>m)  is  no 
more  effective  than  the  minimal  order  (I^ra)  . This  again  was  exhibited 
in  numerical  experiments  ([2],  [10]),  which  confirmed  that  the  multi-grid 
efficiency  is  not  improved  (except  in  the  fq/pl  first  cycles)  by  using 
I > m. 
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An  efficient  method  to  implement  high-order  interpolations  in  case  of 
equations  of  the  form  A U = F is  to  base  the  interpolation  on  suitably- 
rotated  difference  approximations.  See  [14,  p.  53]  and  [7]. 

A. 3.  The  Effective  Smoothing  Rate.  The  smoothing  rate  p was  defined 
in  (3.8)  as  the  slowest  convergence  rate  for  all  components  not  represented 
at  the  coarser  level.  More  relevant,  however,  is  the  slowest  rate  among 
all  components  for  which  the  coarse-grid  correction  is  not  effective, 
namely , 


p = max  { p ( 9 ) 


oo(6)  1}  , 


which  we  call  the  "effective  smoothing  rate".  It  is  clear,  on  one  hand, 
that  go  rate  faster  than  p can  be  generally  obtained  as  rate  of ^convergence 
per  G”  relaxation  sweep,  no  matter  how  well  and  how  often  the  G problem 
is  solved.  On  the  other  hand,  the  rate  p can  actually  be  attained  (or 
approached)  by  correctly  balancing  the  number  of  relaxation  sweeps  in 
between  CGC  cycles  (see  Sec.  A. 6).  In  most  cases  (all  cases  examige^l  by 
us)  one  can  make  0^(9)  ^ 1 for  all  j 0 1 <_  j-  by  proper  choice  of  I ^ (see 
Sec.  A. 4) , and  it  is  therefore  justifiable  to  use  p as  the  effective 
rate  when  relaxation  schemes  are  studied  by  themselves. 

k-1 

A. 4.  The  Fine-to-Coarse  Weighting  of  Residuals  (I  ) , and  the 


Coarse-Grid  Operator  L 


The  transfer  of  the  G residuals 


r = f - L u to  the  coarser  grid  G , to  serve  there  as  the  right-hand 


(see  Sec.  4,  Step  e)  can  be  made  in  many  ways.  Generally 


f is  defined  as  some  weighted  average  of  the  residuals  in  neighboring  G 
points : 

, „k-l  , , k-1  k,  . _ k, 

(A. 9)  f (x)  = I ^ r (x)  = £ r (x+vh)  , 

where  v=  (v^ , , . . . , v^)  , v.  integers,  and  the  summation  is  over  a small 
set.  In  terms  of  these  weights,  p(9)  in  (A. 2)  is  given  by 


(A. 10) 


P (0)  = E Pv  e 


The  coarse  grid  operator  L can  alsg  be  chosen  in  many  ways,  e.g., 
as  some  weighted  average  of  the  operator  L in  neighboring  points . 

How  are  these  choices  to  be  made?  The  main  purpose  should  be  to  min- 
imize a,  but  without  investing  two  much  computatgogal  work  in  the  weighting. 
Usually,  it  is  preferable  to  adjust  p and  not  L , becauge^this  provides 
enough  control  on  a (cf.  (A. 2))  and  because  complicating  L adds  many 
more  computations  and  gets  increasingly  complicated  as  one  advances  to  still 
coarser  levels.  For  the  programmer,  using  the  same  operators  at  all  levels 
is.  an  important  simplification  (cf.  App.  B) , especially  for  non-linear  pro- 
blems . 
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It  is  clear  from  (A. 4)  that  we  should  take  p (o)  = Z p =1.  There  is 
no  apriori  restriction,  however,  on  the  siqns  of  the  weights  p . The 
trivial  weighting 


(A. 11) 


p =1 , p =0  for  v^O; 
o v 


P (0)  = 1, 


called  injection , has  an  important  advantage  in  saving  computations,  not 
only  because  the  weighting  itself  ig_|aved,  but  mainly  because  it  requires 
the  computation  of  £ only  at  the  G points,  while  other  weighting  schemes 
compute  r at  all  G points,  an  additional  work  comparable  to  one  G*  relax- 
ation sweep. 

Examples . For  symmetric  second-order  equations,  injection  should 
ugu^lly  be  used.  For  thg  5-point  Laplace  operator,  fgr^example,  if  we  take 


to  be  injection,  linear  interpolation  and  L 


also  a 5-point 


Laplace  operator,  we  get  o = a =1,  the  minimal  possible  value.  Any 
weighting  is  a pure  waste,  inc?uding  the  "optimal"  weighting 


(A. 12) 


P01  P0-l  P10  p- 


Pag  = ° for  | a|  + | 6 1 > 1 , 


which  minimized  oq,  giving  oq  = — , but  does  not  lower  a.  Numerical  tests 
(modifying  the  program  of  Appendix  B)  indeed  showed  no  improvement  by 
weighting.  If,  however,  the  equation  has  strong  variation,  making 
quite  different  from  B^,  we  may  get  for  injection  a = > 1 , while 

weighting  (A. 12)  will  keep  safely  below  1,  giving  a = 1. 

For  higher-order  equation^,  non-jtrjvial  weighting  offers  an  important 
advantage.  If,  for  example,  L and  L are  13-points  biharmonic  operators 
and  I cubic  interpolation,  then  o = 3 for  injection,  while  a = 1 

for  the  weighting 


P01  P0-l  p10 


p«6  = 0 


M + [e|  * i. 


A. 5.  Finite  Elements  Procedures.  The  main  difference  between  finite- 
element  and  finite-difference  multi-grid  procedures  is  in  the  interpolation 
schemes.  In  the  finite-element  case,  interpolation  procedures  follow  auto- 
matically from  tjie  variational  formulation  and  tjie  definition  ojf  the  approx- 
imation spgc^s  S (corresponding  to  the  levels  G ) . Usually,  S is  a sub- 
space of  S . The  coarse-to-fine  interpolation  is,  therefore,  simply  the 
identity  operation.  Also,  if  the  variationgl  problem  in  S is  to  minimize 
A^  (V  ),  then,  fgr^any  given  approximation  v , the  correction  problem  in  the 
coarser  space  S is,  simply,  to  minimize 


(8.13) 


(Vk  1)  = Ak(vk  + V*  1) 


Example . Consider  the  standard  example,  whejre  S is  the  space  of 
piecewize  linear  functions  on  the  triangulation  G and  A^  is  a Dirichlet 
intggral^who^e  minimization  is  equivalent  to  the  difference  equation 
A V = F , A being  the  5-point  Laplacian.  Computing  hy  (A,B)  , it 

K— 1 k— 1 k— 1 k k K 

turns  ojit^to  be  equivalent  to  the  equation  A V = I (F  -A  v ) , 
where  I , has  the  weights  {cf.  Sec.  A. 4). 

n _ 1 _ 1 
P00  4 ' P01  P11  P10  P0-l  P-1  -1  ~ P-10  " 8 ‘ 

These  weights  give  the  same  multi-grid  convergence  rate  as  injection  (and 
are,  therefore,  redundant). 


A. 6.  Criteria  for  Slow  Convergence  Rates.  (A)  Relaxation  sweeps , 


say  on  G , should  be  discontinued,  and  a switch  should  be  made  to  a coarse- 
grid  correction,  when  the  rate  of  convergence  becomes  slow;  e.g. , when 


(A. 14) 


residual  norm 


residual  norm  a sweep  earlier  — 


ri  = 


a3  + 3y 
o3  + 3 


The  norm  here  is  a suitable  (e.g.,  L_ , L or  (A. 18))  discrete  measure, 

2 00 

usually  of  the  "dynamic"  res^duals^  that  is,  residuals  computed  incidentally 
to  the  relaxation  process.  y and  a arg^efined  in  (A. 8)  and_(A.5),  respec- 
tively. _ Usually,  one  can  choose  the  I weighting  so  that  a=l,  in  which 
case  y-=y.  In  any  case,  (A.  14)  is  designed  to  ensure  that,  on  one  hand, 
the  CGC  cycle  is  delayed  enough  to  make  its  o magnification  small  compared 
with  the  intermediate  reduction  by  relaxation  sweeps.  On  the  other  hand, 
for  9 with  y(0)  considerably  slower  than  y,  the  CGC  cycles  are  still  suf- 
ficiently frequent  to  compensate  for  the  slower  y , since  their  reduction 
rate  a(6)  decreases  rapidly  ((A. 4)  with  p(0)=l).  the  stopping  rule  (A. 14) 
also  prevents  low  error  frequencies  from  dominating  relaxation,  thus 
avoiding  significant  feeding  from  low  to  high  frequencies  (through  boundary 
and  nonlinear  interactions) . 

If  the  "stopping  rate"  n varies  over  the  domain  of  computations  (as  a 
result  of  variations  in  L,  in  case  of  nonlinear  or  non-constant-coefficients 
problems),  the  largest  n should  be  chosen  for  the  stopping  criterion  (A. 14). 
If  log  n changes  too  much  over  the  domain  (which  should  not  happen  when 
a proper  relaxation  scheme  is  used),  then  (A. 14)  must  be  checked  separately 
in  subdomains , and  partial  sweeping  (see  Sec.  A. 9)  might  be  used. 

An  appropriate  value  of  n may  also  easily  be  found  by  direct  trial  and 
error.  Such  value  is  typical  to  the  (locally  linearized,  coefficient- 
freezed)  problem,  is  independent  of  either  h,  ft  or  F,  and  may  therefore 
be  found,  once  for  all,  on  a moderately  coarse  grid.  In  some  nonlinear 
problems  the  value  may  need  some  adjustment  as  the  computations  proceed. 
Whenever  the  coarse-grid  corrections  seem  to  be  ineffective,  n should  be 
increased,  e.g.,  to  (l+3q)/4.  Generally,  the  overall  multi-grid  con- 
vergence rate  is  not  much  sensitive  to  increasing  n:  At^^jst,  the  rate 

may  become  g instead  of  the  theoretically  best  rate  max  y (cf. 

Sec.  6.2). 
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For  the  Poisson  equation  with  Gauss-Seidel  relaxation,  for  example, 

- % - 

we  have  o=l,  y=y=  .5,  hence  rp.6^.,  ^he  example  in  Appendix  B shows  that 
the  optimal  MG  convergence  rate  y ^ .595  is  indeed  attained.  Experiment- 
ing with  this  program  gave  similar  results  for  any  smaller  n (the  reason 
being  that  the  minimal  number  of  two  sweeps  at  each  level  is  good  enough 
in  this  problem)  , while  for  any  n<_. 95  the  total  amount  of  computational 
work  was  no  more  them  twice  the  work  at  n= • 62 . 

(B)  Another  way  to  decide  upon  discontinuation  of  relaxation  is  to 
directly  measure  the  smoothness  of  the  residuals.  The  switch  to  coarser 
grids  can  be  made,  for  instance,  when  differences  between  residuals  at 
neighboring  points  are  small  compared  with  the  residuals  themselves. 


A. 7,  Convergence  Criteria  on  Coarser  Grids.  In  the  CgC^mode  analysis 
above  it  was  assumed  that  the  problem  on  the  coarser  grid  G was  fully 
solved  and  then  interpolated  as  a correction  to  the  G approximation . In 
the  actual  multi-grid  algorithm  (Sec.  4)  we  solve  the  G problem  iter- 
atively, stopping  the  iterations  when  some  convergence  criterion  is  met. 

This  criterion  should  roughly  detect  the  situation  at^which  more  improve- 
ment (per  unit  work)  is  obtained  by  relaxing  on  the  G grid  (after  inter- 
polating) then  by  further  iterating  the  G ' problem  (before  interpolating) . 
A crude  mode  analysis  (similar  to  Sec.  4.6.2  in  [3])  shows  that  such  a cri- 
terion is 


(A. 15) 
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< 6 
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where  d is  the  dimension,  o is  given  by  (A. 5), 
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T* 
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on  the  G 


R(6,e')  Bk (0 ' ) 


Vil!el 


-1 


k-1. 


p (6 ' ) 


is  any  norm  of  the 


current  residuals  in  the 
norm  in  the  G problem. 


grid  (cf . (A. 8) ) . | |r 

Gk  ^ problem,  while  | | rk | | is  the  corresponding 
It  is  important  that  these  norms  are  comparable: 


They^s^ould  be  discrete  approximations  to  the  same  continuum  norms.  Also, 
if  r ^_pre  t^ie  "dynamic"  residuals  (i.e.,  computed  incidentally  to  the 
last  G relaxation  sweep,  using^latest  available  values  of  the  relaxed 

solution)  then  r should  be  the  G*  dynamic  residuals,  unlike  the  residuals 
k-1  k-1 

transferred  to  G (to  define  f ; cf.  Sec.  A. 4)  which  must  be  "static" 
residuals  (i.e.,  computed  over  j:he  grijl  pithout  changing  the  solution  at 
the  same  time).  If,  however,  r'  and  r are  static  and  dynamic,  respectively, 
the  parameter  6 in  (A. 15)  should  be  multiplied  by  a certain  factor  £5  (see 
Sec.  4.6.2  in  [3] ) . 
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The  stopping  criterion  (A. 15)  is  based  on  the  assumption  that  error 
con^sonents  with  J 0 [ = — dominates  the  process.  In  the  first  Tq/pl  CGC 
cycles,  however,  lower  components  are  dominant,  and  the  mjiiy  consideration 
is  to  converge  them.  Hence,  at  that  initial  state,  the  G convergence 
criteria  should  be 


(A. 16) 


< ||Tk-1| 


. k-1  , k-1 

where  t are  the  G truncation  errors  (cf . Sec.  A. 8) . 


The  key  factor  6 can  also  be  found  by  trial  and  error.  Like  n above, 
it  is  essentially  independent  of  h,  J)  and  F,  and  may,  therefore,  be  found 
once  for  all  by  tests  on  moderately  coarse  grids.  Numerical  experiments 
show  that  the  overall  multi-grid  efficiency  is  not  much  sensitive  to  very 
large  variations  in  6 and,  in  particular,  6 may  be  lowered  by  orders  of 
magnitudes  without  large  changes  in  the  efficiency.  For  example: 

For  the  5-points  Poisson  equation  with  Gauss-Seidel  relaxation,  in- 
jection and  linear  interpolations,  (A. 15)  yields  6 = .219.  Numerical 
experiment  (e.g. , with  the  program  in  Appendix  B)  show  that  with  any 
.001  < p < .5  the  computational  work  is  no  more  than  25%  above  the  work 
with  p = .22,  and  no  more  them  100%  extra  work  for  any  .0001  < p < .7. 


M 

A. 8.  Convergence  on  the  Finest  Grid.  On  the  finest  grid  G the  so- 
lution is  usually  considered  converged  when  the  (static)  residuals  are  of 
the  order  of  the  truncation  error,  in  some  appropriate  norm.  One  way  to 
estimate  the  truncation  error  is  to  measure  them  on  coarser  grids  by  (5.7) , 
and  extrapolate  (taking  into  account  that  they  are  0{h^)).M  Another,  related 
but  more  straightforward  criterion  is  to  detectMwJen  the  G solution 
has  contributed  most  of  its  correction  to  the  G solut^oiji.  In  the 
FAS  algorithm  the  natural  place  to  check  is  when  a new  F is  computed, 
the  convergence  test  being 


(A. 17) 


I -M-l  -M-l  | 

F - F 

1 previous 1 


-M-l 


The  norm  here  may  be  any  (L  , L^,  etc.),  but  the  most  relevant  one 
is  the  discrete  version  of  the  norm  (cf.  Sec.  8.1) 

(A. 18)  | |f  | | *=  / G (x)  | f (x)  | dx 


^ A. 9.  Partial  Relaxation  Sweeps.  A partial  relaxation^sweep  over 
G is  a relaxation  sweep  that  may  skip  some  subdomains  of  G . (Unlike 
"selective"  relaxation  sweeps,  which  in  principle  pass  through  all  the 
grid  points,  although  corrections  may  not  be  introduced  in  some  of  them. 
Cf.  Sec.  3.2.  A partial  sweep  may  be  selective,  too.) 

Partial  sweeps  are  not  used  much  in  standard  relaxation  calculations . 
Usually,  a slow-to-converge  subdomain  is  coupled  to  other  subdomains 
and  therefore  cannot  he  relaxed  separately.  In  the  multi-grid  process, 
however,  only  high-frequency  error  components  are  to  be  reduced  by  relax- 
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ation,  and  this  can  be  done  separately  in  subdomains:  With  regard  to 

high-frequencies,  subdomains  are  practically  decoupled.  Hence,  in  the 
multi-grip  process,  partial  sweeps  are  potentially  very  important.  In 
fact,  high-frequency  amplitudes  may  vary  greatly  over  the  domain,  especially 
if  p and  o vary  much,  or  if  high-frequency  error  components  are  introduced 
at  boundaries,  making  partial  sweeping  there  very  desirable. 

Partial  sweeping  may  be  performed  by  applying  a criterion  for  slow 
convergence  (Sec.  A. 6)  separately  in  subdomains.  (If  the  connected 
reg^on^of  partial  relaxation  is  small,  n in  (A. 14)  should  be  changed  to 
(o  p+3p)/(o  +3),  where  p is  the  largest  amplification  factor  for  Fourier 
components  on  the  relaxed  region.)  A subdomain  may  be  excluded  from 
subsequent  relaxation  sweeps  if  slow  convergence  is  shown  simultaneously 
on  that  subdomain  and  on  all  neighboring  subdomains,  tinder  relaxation 
may  be  used  to  phase-out  the  relaxed  region  (cf.  [3],  Sec.  4.6.4).  The 
subdomains  may  be  chosen  quite  arbitrarily,  but  each  of  them  should  be 
large  enough  (at  least  4x4)  to  allow  for  separate  smoothing. 


A. 10.  Convergence  Criteria  on  Non-uniform  Grids 
k k-1 

When  G and  G are  not  coextensive  (i.e.,  the  domain 

k k-1 

covered  by  G is  only  part  of  the  G domain;  cf.  Sec.  7.2), 
the  convergence  criteria  (Secs.  A.7-8)  should  be  slightly  modified. 
First,  in  (A. 15),  | | rk  | | is  not  a comparable  norm,  since  it  may  be 
measured  on  a much  narrower  subdomain.  Instead,  one  can  use  the 


test 


(A. 19)  | |rk_1|  | < <5 1 | rk  1 1 |/n, 


k 1 k*“l 

where  | | r | [ is  the  residual  norm  computed  on  G at  the  first 
i k 

relaxation  sweep  after  switching  from  G . The  division  by  n in 
(A  19)  is  designed  to  compensate  for  the  fact  that  | | rk  1 | | is 
computed  a sweep  later  than  I | r | j . 

The  other  modification  is  in  (A. 17),  where  it  was  assumed  that 
G^  is  the  finest  level  everywhere.  Generally,  the  convergence 
test  can  be,  for  example, 


(A. 20) 
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where  the 

norms  are  taken  over 

^ (or,  more  precisely, 

over 

Gk+1 
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APPENDIX  B;  SAMPLE  MULTI-GRID  PROGRAM  AND  OUTPUT. 

This  simple  program  of  Cycle  C (written  in  1974  by  the  author  at  the 
Weizmann  Institute)  illustrates  multi-grid  programming  techniques  and 
exhibits  the  typical  behavior  of  the  solution  process.  For  a full 
description  of  Cycle  C,  see  Sec.  4 or  the  flowchart  in  Fig.  1. 


The  program  solves  a Dirichlet  problem  for  Poisson  equation  og  ^ 
rectangle.  The  same  5-point  operator  is  used  on  all  gridg.  The  I 
residuals  transfer  is  the  trivial  one  (injection)  , the  I^_^  interpolation 
is  linear.  The  higher  interpolation  (A. 7)  and  the  special  stopping 
criterion  (A. 16),  recommended  for  the  first  [q/p]  cycles,  are  not  implemented 
here . 


k k k 

For  each  gjrid  G we  store  botJi+^  and  f (k=l  ,2 , . . . ,M)  . For  handling 
these  arrays  f is  also  called  v . The  coarsest  grid  has  NXO  x NYO 
intervals  of  length  HO  each.  Subsequent  grids  are  defined  as  straight  re- 
finements, with  mesh  sizes  H(k)  = H0/2**(k-l).  The  function  F(x,y)  is  the 
right-hand  side  of  the  Poisson  equatiog.  The  function  G(x,y)  serves  both 
as^the  Dirichlet  boundary  condition  (4>  ) and  as  the  first  approximation 
(u  ) . The  program  cycles  until  the  L norm  of  the  residuals  on  G is  re- 
duced belc^  TOL,  unless  WORK  exceeds  WMAX.  After  each  relaxation  sweep  on 
any  grid  G , a line  is  printed  out  showing  the  level  k,  the  L^  norm  of  the 
("dynamic")  residuals  computed  in  course  of  this  relaxation,  and  WORK, 
which  is  the  accumulated  relaxation  work  (where  a sweep  on  the  finest 
grid  is  taken  as  the  work  unit) . 


Note  the  key  r^>le  of  the  GRDFN  and  KEY  subroutines.  The  first  is  used 
to  define  a grid  (v  ),  i.e.,  to  allocate  for  it  space  in  the  general  vector 
Q (where  IQ  points  to  Jihe  next  available  location) , and  to  store  its  para- 
meters. To  use  grid  v , CALL  KEY (k , IST,M ,N,H)  retrieves  the  grid  para- 
meters (dimension  MxN  and  mesh-size  H)  and  sets  the  array  IST(i)  so  that 
v. . = Q(IST(i)+j).  This  makes  it  easy  to  write  one  routine  for  all  grids 

v ; see  for  example,  Subroutine  PUTZ(k).  Or  to  write  the  same  routines 
(RELAX,  INTADD,  RESCAL)  for  all  levels. 


To  solve  on  the  same  domain  problems  other  than  Poisson,  the  only 
subroutines  to  be  changed  are  the  relaxation  routine  RELAX  and  the  re- 
sidual injection  routine  RESCAL,  the  latter  being  just  a slight  variation 
of  the  first. 

For  different  domains,  more  general  GRDFN  and  KEY  subroutines  should 
be  written.  A general  GRDFN  subroutine,  in  which  the  domain  characteristic 
function  is  one  of  the  parameters,  has  been  developed,  together  with  the 
corresponding  KEY  routine.  This  essentially  reduces  the  programming  of 
any  multi-grid  solution  to programming  a usual  relaxation  routine. 
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feogram  cycle  c 

EXTERNAL  G , F 

CALL  MULTIG  (3, 2 , 1 . , 6 , . 01 , 3 0 . ,G , F| 

STOP 

END 

FUNCTION  F (X,Y) 

F = SIN  (3.*  (X*  Y) ) 

RETURN 

END 


CYCLE  C 


Right-hand  side  of  the  equation 


FUNCTION  G ( X , Y)  Boundary  values  and  first  approximation 

G = COS  (2.*  (X  + Y)  ) 

RETURN 

END 

SUBROUTINE  MULTIG ( NXO , N Y0 , HO , M , TOL , U M AX, U 1 , F) 

EXTERNAL  U1,F 

DIMENSION  EPS  (10)  Multi-g’ :d  algorithm  (see  Fig.  1) 

DO  1 K = 1 , M 
K2=2**  (K - 1 ) 

CALL  GRDFN ( K , N X 0* K2+ 1 , N Y0 * K2 + 1 , HO /K2) 

CALL  GRDFN ( K + M , N X C* K2+ 1 , N YC * K2+ 1 , HC/K2) 

EPS  (M)  =TOL 
K =M 
WU  = 0 

CALL  P UTF  (M  , U 1 , 0) 

CALL  PUTF  (2*  M , F , 2) 

ERR= 1 . E30 
ERRP=EEB 

CALL  RELAX(K,K+M,ERR) 

WU=WU+4 . **  (K-M) 

WRITE  (6,4)  K , ERR  , WU 

FORMAT  ('  LEVEL' ,12, ' RESIDUAL  NORM= ', 1 PE  1 0 . 3 , ' »ORK  = • , C P F7 . 3) 

IF  (ERR.LT.EPS  (K)  ) GOTO  2 
IF  (WU.GE.WMAX) RETURN 

IF  (K.EQ.  I.OR.  ERR/ERRP.LT.  . 6)  GOTO  3 tv=.6 

CALL  RESCAL (K, K+ M, K+M-1) 

EPS  (K- 1)  =.3*ERR  6«=. 3 

K = K-  1 

CALL  PUTZ  (K) 

GOTO  5 

IF  (K.EQ.M) RETURN 
CALL  I NT ADD  (K, K+ 1) 

K =K  + 1 
GOTO  5 
END 


SUBROUTINE  GRDFN  (N,  IMAX,JM  AX  ,HH)  Define  an  IMAX  x JMAX 

COMMON/GRD/NST (20)  ,IMX(20)  ,JMX(20)  ,H  (20)  N 

DATA  IQ/1/  array  v . 

NST  (N)  =IQ 
1 M X (N)  =1  M A X 
JMX  (N)  =J  MAX 
H (N) =HH 

!Q=lQtIMAX* JMAX 

RETURN 

END 

SUBROUTINE  KEY  (K , 1ST, 1 MAX , JMAX, HH) 
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COM;iON/GRD/NST(20)  ,TMX  (20)  , JMX(20)  ,U  (20)  Set  1ST  such  that 

DIMENSION  I S T ( 1 ) 

IMAX=IMX  (K) 

J M A X = J M X (K) 

I S=N ST  (K) -JMAX- 1 
DO  1 1=1 ,IMAX 
IS=IS  + JMAX 
1 ISI(I)=IS 
HH  = H (K) 

RETURN 
END 

SUBROUTINE  PUTF(K,F,NH) 

COMMON  0 (18000)  ,IST  (600) 

CALL  KEY  (K  , I ST , II , J J , H) 

H2=H**NH 
DO  1 1=1,11 
DO  1 J=1,JJ 
X = (I-  1)  *H 
Y=  (J  - 1)  *H 

1 0 (1ST  (I)  + J)  = E (X,  Y)  *H2 
RETURN 
END 

SUBROUTINE  PUTZ  (K) 

COMMON  Q (18000)  , I ST  (200) 

CALL  KEY  (K, 1ST, II , JJ,H) 

DO  1 1=1,11 
DO  1 J=1,JJ 
1 Q (1ST  (I)  -U)  =0. 

RETURN 
FND 

SUBROUTINE  RELAX (K, KRHS,ERR) 

COMMON  Q (18000)  ,ISI(200)  ,ISHS(200)  A Gauss-Seidel  Relaxation  sweep 

CALL  KEY  (K, 1ST, II , JJ,H) 

CALL  KEY  (KRH  S,  IEH  S, II , JJ  , H)  on  the  equation 

11=11-1  . K KRHS 


= | | residuals | | 

IM  = 1ST (I- 1 ) : 

IP  = IST  (1+  1) 

DO  1 J=2 , J 1 

A=Q  (IR+J)  -Q  (IO+J+1)  -Q  (IO+J-1)  -Q  (IM+J)  -Q  ( I P+ J) 

ESR  = ERP.+  (At  h . *2  ( IO  + J)  ) * *2 
1 Q (IO  + J)  =- .25*A 
ERR=5QRT  (ERR)  /FI 
RETURN 
END 


J 1 = J J-  1 

ERR  =0 . 

DO  1 1=2,11 
I R = I P H S (I) 
I0  = IST  (I) 


V 


giving 

ERR 


vk(I,J)=  Q (IST(I)  + J) , 

and  set  IMAX  = IMX(K) 
JMAX  = JMX(K) 

HH  = H (K) 


K , . NH  K 
v *■  H(K)  • F 


K 

v *•  0 


SUBROUTINE  I N TA D D ( KC , KF) 

COMMON  0 (18000)  ,I5TC  (200)  ,TSTF(200) 
C A Lx.  KEY  (KC  , I STC  , IiC  , JJC  , F!C) 

CALL  KEY  (KF  , ISTF  , II F , J JF  , IFF) 

DO  1 IC=2, I IC 
1 1=2*  IC-  1 
.7  F = 1 


Linear  interpolation  and  additioi 

KF  KF  KF  KC 
v *■  v + I v 

KC 


Lr 
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\ 

*** 

» ; 


IFO  = ISTF  (IF) 

IFM=ISTF  (IF-  1) 

ICO  = ISTC  (IC) 

ICM  =ISTC  (IC-  1) 

DO  1 JC=2,JJC 
JF=JF+2 

A = . 5*  (Q  (ICO+  JC)  + Q (ICG  + JC-  1)  ) 

AM  = . 5*  (Q  (ICM  + JC)  + Q (ICM  + JC  - 1)  ) 

Q (IFO+JF)  = Q (IFO  + JF)  + Q (ICO+  JC) 

0(IFM+JF)  = Q (IFM  + JF)  + .5*  (Q  (ICO  + JC)  ♦ J (ICM+  JC)  ) 
Q (IFO+JF-  1)  =0  (IFO  + JF- 1)  + A 
1 Q (IFM+JF-1)  = Q (IFM+JF-1)  + .5*(A  + AM) 

RETURN 

END 


SUBROUTINE  RESCA L (KF , KRF , KRC) 

COMMON  Q (13  000)  , I UF  (2  00)  ,IRF  (200)  , IRC  (200) 
CALL  KEY  (KF , IUF ,IIF, JJF,HF) 

CALL  KEY  (KRF  , IR  F , IIF  , J JF  , HF) 

CALL  KEY  (KRC  , IRC  , IIC , J JC  , HC) 

IIC  1=IIC- 1 
JJC  1=JJC- 1 
DO  1 IC=2, IIC 1 
ICR  = IRC  (IC) 

IF  = 2* IC-  1 
J F = 1 

IFR  =IRF  (IF) 

IFO=IUF  (IF) 

IFM=IUF  (IF- 1 ) 

IFP  = IUF  ( IF+ 1 ) 

DO  1 JC=2, J JC1 
JF=JF  + 2 

S = Q (IFO+JF+  1)  + Q (IFO  + JF-1)  +Q  (If  M+  JF)  + Q (IFP  + JF) 
1 Q (ICR+JC) =4 . * (Q (IFR  + JF)  -S+4. *Q (IFO+JF)  ) 

RETURN 

END 


Residuals  injection 


KRC 


.coarse 


fine 


(v 


KRF 


- V 


KF. 


F 


LEVEL 

6 

RESIDUAL 

N 3 F M = 2.6141401 

WOF;K^ 

1.000 

LEVEL 

6 

RESIDUAL 

N3EM  = 2.764E+01 

WOR  K- 

2.000 

LEVEL 

5 

RESIDUAL 

N3RM  = 2 . 659E ♦ 0 1 

WOR  K- 

2.250 

LEVEL 

5 

RESIDUAL 

NO  EM  = 2.555E401 

WOR  K~ 

2.  500 

LEVEL 

4 

RESIDUAL 

N 3 R M = 2.317E+01 

WOF.K= 

2.  563 

LEVEL 

4 

RESIDUAL 

NORM  = 2.0S5E+01 

WOR  K= 

2.  625 

LEV  EL 

3 

RESIDUAL 

NO  R M = 1 . 649E+  0 1 

WORK- 

2.  64  1 

LEVEL 

3 

RESIDUAL 

NOR  M = 1.265E-r01 

WG  R K= 

2.  656 

LEVEL 

2 

RESIDUAL 

N 0 PM  = 7.626E*0C 

WORK= 

2.660 

LEVEL 

2 

RESIDUAL 

N 0 R M = 3 . 84  0E4  3 0 

WOE  K= 

2.  664 

LEVEL 

3 

RESIDUAL 

NO  R M = 5.058E400 

WGRK  = 

2.  680 

LEVEL 

4 

RESIDUAL 

N 3 F.  M = 8.006E+00 

WORK= 

2.  742 

LEVEL 

4 

RESIDUAL 

N 3 R M = 2.545E+00 

WORK= 

2.  805 

LEVEL 

RESIDUAL 

N 3 R M = 9.736E+00 

WOR  K= 

3.  055 

LEVEL 

c 

RESIDUAL 

N 3 F.  M = 2.464E+C0 

WORK= 

3.  305 

i.  £ V EL 

6 

RESIDUAL 

NORM  = 1.064E+01 

WOR  K= 

4.  305 

LE  V EL 

6 

RESIDUAL 

N3F.M=  2.442E+00 

WGRK= 

5.  305 

LEVEL 

6 

RESIDUAL 

N3RM=  2. 3Q9E+C0 

WOR  K= 

6.  305 

LEV  EE 

5 

RESIDUAL 

N 3 R M = 2.351E+00 

WOR  K= 

6.555 

L EV  EL 

5 

RESIDUAL 

N 0 R M = 2.3C3E+00 

WOR  K= 

6.  805 

LEVEL 

4 

RESIDUAL 

N 0 R M = 2 . 1 7 3 E 4 C 0 

WOR  K= 

6.867 

LEVEL 

4 

RESIDUAL 

NORM=  2.043E400 

WOR  K= 

o.  930 

LEVEL 

3 

RESIDUAL 

N3RM  = 1.739E400 

WORK= 

6.945 

I . 

LEVEL 

3 

RESIDUAL 

N 0 R M = 1 . 4 5 3 E 4 0 0 

WOR  K= 

6.  961 

LEVEL 

2 

RESIDUAL 

N3RM=  9. 869E-01 

WOR  K= 

6.965 

LEVEL 

2 

RESIDUAL 

N3RM  = 6 . 183E-01 

WOR  K= 

6.969 

LEVEL 

1 

RESIDUAL 

N3RM  = 2.760E-01 

WOP.  K= 

6. 970 

LEVEL 

1 

RESIDUAL 

NORM  = 5.170E-02 

WOR  K= 

6.971 

LEVEL 

2 

RESIDUAL 

N3RM=  2.2 92E-01 

WOR  K= 

6.975 

LEVEL 

3 

RESIDUAL 

NO  P M = 5.465E-01 

WOR  K= 

6.990 

4 

LEVEL 

4 

RESIDUAL 

N3RM=  7.71CE-01 

WOP  K= 

7.053 

LEVEL 

4 

RESIDUAL 

N0RM=  1 . 163E-01 

WOF.K= 

7.115 

LEVEL 

5 

RESIDUAL 

N 3 R M = 8 . 651 E- 01 

WoRK= 

7.365 

T 

LEV  EL 

5 

RESIDUAL 

N3EM  = 1.058E-C1 

WOR  K= 

7.  615 

LEVEL 

6 

RESIDUAL 

N 3 R M = 9.059E-01 

WORK= 

9.615 

LEVEL 

6 

RESIDUAL 

»3RM=  1.052E-01 

WOR  K= 

9.615 

LEVEL 

6 

RESIDUAL 

N3RM=  1.012E-01 

WORK= 

10.615 

E * 1 

Li  LiV  £ ji. 

5 

RESIDUA  L 

N3  RM  = 9.759E-02 

WOR  E= 

10. 865 

i - 

LEVEL 

5 

RESIDUAL 

M 3 R M = 9.452E-02 

WORK= 

11.115 

f '■ 

LEVEL 

4 

RESIDUAL 

N3RM=  8.710E-02 

WOR  K= 

11.178 

: \ 

LEVEL 

4 

RESIDUAL 

N3  R M = 7.960E-02 

WORK= 

11.240 

LEVEL 

3 

RESIDUAL 

N 0 R H = 6.389  -02 

WOR  K= 

11.2^6 

1 ; 

LEVEL 

3 

RESIDUA  L 

N 3 R M = 4 . 93  IE  - 02 

WOR  K= 

1 1. 271 

LEVEL 

2 

RESIDUAL 

NO  RM  = 2 . 916E-02 

WOR  K= 

1 1. 275 

LEVEL 

2 

RESIDUAL 

N3RM=  1.622E-02 

WORK= 

11.279 

1'  *4 

L EV  En 

2 

RESIDUAL 

N3RM=  1.017E-02 

WOR  K= 

11.283 

LEVEL 

3 

RESIDUAL 

N3RM=  1.949E-02 

WOR  K= 

11.299 

* 

LEVEL 

4 

RESIDUAL 

NO  E M = 3.128E-02 

WOP  K= 

11.361 

LEVEL 

4 

RESIDUAL 

N3RM=  8.843E-03 

WOR  K = 

1 1. 424 

1 t. 

LEVEL 

5 

RESIDUAL 

NORM=  3. 710E-02 

WOR  K= 

1 1.674 

LEVEL 

c 

RESIDUAL 

N 3 R M = 8.486E-03 

WORK= 

1 1. 924 

f 

LEVEL 

6 

RESIDUAL 

N3  R M = 4.0C7E-02 

WOR  K= 

12. 924 

V* 

A 

LEVEL 

6 

RESIDUAL 

N3RM  = 9.051E-03 

WOR  K= 

13. 924 

OUTPUT 


Error  reduction  by  a factor 
greater  than  10  per  cycle. 

Each  cycle  costs  4.3  wu 

Insensitivity :Results  would 
be  practically  the  same 
for  any  .005  <_  6 <_  .5 
or  any  0 < n < .65 
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APPENDIX  C.  RIGOROUS  BOUND  TO  MODEL-PROBLEM  CONVERGENCE  RATE. 


We  consider  the  model  problem:  5-jjjoints  Poisson  equation  A^U^=F 

on  a (n^+1)  x (n^+l)  rectangular  grid  G with  Dirichlet  boundary  conditions. 

Let  n.  = 2v  and  let  G^  be  the  ^^Sh+l)  x (w^N  +1)  uniform  grid  on  the  same 
3 3 i ^ 

domain,  with  mesh  size  h=2  h , (k=o,  1,  . . . , M) . We  will  estimate  the 

* ° M 

convergence  rate  and  work  in  one  multi-grid  cycle  C . 

M 

The  cycle  C is  Refined  inductively  as  fallows:  (i)  Make  r relax- 

ation sweeps  on  the  G approximate  solution  u . To  facilitate  the  rigorous 
Fourier  analysis  we  choose  as  our  relaxation  the  Weighted  Simultaneous  Dis- 
placement (WSD,  or  "weighted  Jacobi")  method  with  the  optimal  weights  01^^=48/41, 
o»Oi=W q_  . =u>..  _=u>  . -=8/ 4 ^ ^see  Sec-  3.3).  (ii)  Inject  (cf.  Sec.  A. 4)  the 

M — 1 M-l 

residual  problem  to  G . (iii)  Get  an  approximate  solution  v to  this 

M-l  M-l 

G problem  by  two  c cycles,  starting  from  the  zero  approximation. 

, . . M M _ M M-l  , M . 

(iv)  Correct  u •*-  u + I.„  , v , where  I„  . is  linear  interpolation. 

M-l  M-l 

It  is  easily  calculated  that  one  WSD  sweep  amplifies  the  Fourier  com- 
ponent expfiS’x/h^)  of  the  residual  by  the  factor 

U ( 0 ) = 1 - (2  - cos9^  - cosO^)  (24  + 8cos0^  + 8cos02)  / 41. 


M 

Denote  by  A(0)  the  amplitude,  before  the  C cycle,  of  t£je  0=  ( 9^. ' ®2 > coln- 
ponent  of  the  residual.  Acutally  present  on  the  grid  G are  only  components 
of  the  form  0 = {a^v/n^,a^/n^)  , (a_.=  + 1,  + 2,  ...,  +(n^-l)),  and  their 

amplitudes  AtS^,©^)  = -A(0  ,-6^)  = -A(-0^,02)  are  real  (assuming  two  of 

the  boundary  lines  to  lie  on  the  axes).  Since  p ( 0^ , 0^ ) = P (£0^  , + 02)  3s  rea-*-' 


the  r relaxation  sweeps  operate  separately  on  each  residual  mode,  trans- 
forming its  amplitude  A(0)  to  A’ (0)  = y(0)r  A(0) . 


For  any  component  0=(0^,02)  such  that  | 0 | = max  (leJJeJ)  £ 

e e1  = (e. , 0„) , e2  = (0,  + tt,0„),  e3  = (e,  ,e„  + ■*) , 9 = (i 


tt/2, 


denote  0 = ' 


(81+it,  02+tt)  , 


(01  + tt,02),  0 = (01,02  + tt)  , 

where  each  +_  sign  is  chosen  so  that  j 0 ^ | < it , (£=1,2 ,3,4)  . Of  these  four 

1 M-l 

"harmonics",  only  the  0 mode  appears  on  G , its  amplitude  there  (in  the 

M-l 

right-hand  side  of  the  G residual  problem  formed  in  Step  (ii))  being 


(C.l)  A0  = A' (01)  + A' (02)  + A' (B3)  + A' (04) . 

Let  denote  an  upper  bound  to  the  factors  by  which  any  C cycle  reduces 

the  L2  norm  of  the  residuals  on  G^.  In  particular,  the  two  CM  3 cycles 

(Step  (iii))  are  equivalent  to  solving  a GM  3 problem  with  amplitudes  a0 
instead  of  Afl , where 
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le,  li-  'V*-1’  * E»-4 


Z A.  . 

hi  <1  9 


M-l  M 

Hence,  interpolating  the  computed  correction  from  G to  G (Step  (iv) ) , 
the  new  residual  amplitudes  are  easily  calculated  to  be 

A(0*)  = A’  (0*)  - SO*)  a0 

= pO*)r  AO*)  - S(0*)A0  + SO*)  (A0-a0),  01=1,2,3,4,), 


where 


S ( 0 ) = 


(1  + cos0^) (1  + cosO^)  (4  - 2cos0^  - 20030^) 
4 - 2cos20^  - 2cos202 


(C.  3)  Z A (0*)  2 * < 2q2  Z A (0*)  2 + 2 Z S (0*)  2 (A.-aJ2  , 

X “ £ £ 69 

where  q is  any  upper  bound  to  the  spectral  radii  of  the  4*  matrices  Q ( 0 ) 
defined  by 


a«.  ,6>  ' (6t»-s<et))  "(e”>r  ■ 


(1  < i,m  < 4) 


Denoting  6 . = 1 - cos  0^  , it  is  easy  to  check  that 

1 j 1 ®i  + ^2  91  92  1 

(C . 4 ) Z S(©V  = j(l  + - R ■ R ) 1 J ■ 

l (61  + 62}  12 

Hence,  summing  (C.3)  over  the  relevant  range  of  0,  using  (C.2)  and  (C.4) 
and  then  (C.l),  we  obtain 


Z A ( 0 ) 2 < 2q2 


0 < Tf 


2 4 2 

Z AOr  + eM  , Z „ A 
i „ i M-l  p „ i v 0 

I © I i*  hi 


< (2q2  + Yew  /)  Z A ( 0 ) 2 , 

M_1  tel  < ir 


where  y is  any  upper  bound  to  all  Z y(0  ) , ( 0 < | 0 1 <_  ir/2)  . 

I 

Thus , we  have  obtained  the  bound 

2 2 4 

(C.3)  eM  = 2q  +YeM.1  • 

2 x 

A simple  computer  program  confirms  the  bounds  q = (7/41)  and 

Y = 1 + 3(7/41)  r.  Choose  r=3.  From  (C.5)  it  now  follows,  by  induction 

on  M,  that  r < .101  . 

M — 


M 

The  number  of  operations  in  the  C cycle  is  (12r+3)n^n^  + 2W^_^ 

Hence,  by  induction  on  M,  WM  <_^24r+6)  . We  thus  have  in  summary 

M 

Theorem.  The  above  C cycle  reduces  the  error  by  a factor  <_  .101  and 


costs  78  operations  (additions  and  multiplications)  per  (G  ) 


The  theorem  can  be  improved  (to  .1  reduction  in  only  53  operations - 
per-point) ^b^  defining  the  C cycle  to  consist  of  r+M  relaxation  sweeps  and 
only  one  C cycle,  and  choosing  large  r.  (Employing  arbitrarily  large  r 
pays  only  with  simultaneous-displacement  schemes  on  rectangular  domains, 
where  there  is  no  feed  from  low  to  high  frequencies) . 

In  practice,  .1  reduction  is  obtained  in  about  26  operations.  (See 
App.  B.  The  Gauss-Seidel  sweep  employe^  there  can  be  done  in^5^operatigns- 
per-point.  But  for  every  3 sweeps  on  G the  interpolations  I ^ and  I. 
are  also  performed,  each  costing  an  average  of  6/4  operations  per  point. 
Hence,  a work  unit  in  App.  B should  be  considered  as  representing 
(3x5+  3 ) /3  = 6 operations).  These  operation  involve  only  additions  and 
shifts . 


SINGULAR  VALUE  DECOMPOSITION: 
APPLICATIONS  AND  COMPUTATIONS* 

Gene  H.  Golub 
and 

Franklin  T.  Luk 

Stanford  University,  Stanford,  California 


ABSTRACT.  The  Singular  Value  Decomposition  (SVD)  of  a rec- 
tangular matrix  is  described.  Several  problems  arising  in  data 
analysis  are  given  and  their  solution  is  given  in  terms  of  the  SVD. 
Numerical  methods  are  discussed  for  computing  the  decomposition  for 
dense  and  sparse  matrices. 

1.  INTRODUCTION.  This  paper  is  concerned  with  the  singular 
value  decomposition  of  a given  matrix.  The  decomposition  is  very  use- 
ful although  it  may  not  be  as  familiar  as  some  of  the  other  matrix 
decompositions.  We  shall  describe  the  decomposition,  give  some 
specific  examples  of  its  applications,  and  suggest  some  methods  to 
compute  the  decomposition. 

There  are  many  matrix  decompositions  that  are  useful  in  mathe- 
matical applications.  A very  familiar  one  is  the  QR  decomposition  of 
a square  matrix  A: 

A = QR  , 

where  Q is  an  orthogonal  matrix  and  R is  an  upper  triangular  matrix. 

There  are  several  numerical  schemes  to  compute  this  decomposition.  We 

could  use  the  Gram-Schmidt  method;  the  columns  of  Q are  the  orthogonal 

columns  generated  by  the  process.  Another  way  to  generate  Q,  and  R 

is  through  the  use  of  Householder  transformations. 

Another  familiar  decomposition  is  the  reduction  of  a square 

matrix  to  its  Jordan  canonical  form: 

-1 

A = XJX  , 

where  X is  nonsingular  and  J is  a block  diagonal  matrix  in  which 
* 

This  work  was  in  part  supported  by  U.S.  Army  Research  Grant 

DAHCCA-  -7  5 -G-0185 • 
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This  decomposition  has  been  used  extensively  in  the  study  of  stability 
of  differential  equations.  Unfortunately,  there  does  not  appear  to  be 
any  good  numerical  algorithm  to  compute  the  decomposition  (Golub  and 
Wilkinson  [l^]). 

Finally,  we  shall  discuss  the  singular  value  decomposition  of 
an  m X n matrix  A: 

A = UEv'k  , 


» ; 


* 


Lr 


where  U is  an  m X m orthogonal  matrix,  V is  an  n X n orthogonal 
matrix,  and  E is  an  m X n matrix  with  non -negative  elements  down  the 
main  diagonal  and  zeros  everywhere  else.  For  our  discussion,  we  shall 
assume  that  A has  at  leant  as  many  rows  as  columns  so  that  m > n, 
although  this  is  not  always  the  case.  There  are  many  proofs  of  this 
decomposition,  for  instance,  in  the  book  by  Forsythe  and  Moler  [6]. 

A very  clear  and  useful  discussion  is  given  in  the  book  by  Lanczos: 
"Linear  Differential  Operators"  [17]. 


It  is  not  very  difficult  to  see  that  U consists  of  the 


eigenvectors  of  AA1",  V consists  of  the  eigenvectors  of  A "A  and 


the  diagonal  elements  ff. , 1 < i < n,  of  £ are  the  non-negative 


square  roots  of  the  eigenvalues  of  A A. 
arranged  in  such  a way  that 


We  assume  the 


a. ’ s are 
1 
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> 0 > 0 

— r 


where  r is  the  rank  of  the  matrix  A. 

The  singular  values  and  the  eigenvalues  of  a given  matrix  can 
frequently  differ.  Consider  an  m x m matrix 


The  matrix  A is  of  rank  m-1  but  .all  Us  eigenvalues  equal  0.  However, 
(m-l)  singular  values  of  A equal  1 and  only  one  singular  value  is 
zero.  Hence  the  number  of  non-zero  eigenvalues  of  a matrix  gives  a lower 
bound  on  its  rank,  whereas  the  number  of  non-zero  singular  values  of 
a matrix  is  its  rank. 


2 . APPLICATIONS . In  this  section  we  shall  discuss  some 
applications  of  the  singular  value  decomposition  (cf.  Golub  [3]). 

A.  Let  be  the  set  of  all  m * m orthogonal  matrices. 

We  wish  to  replace  a given  m x m matrix  A by  an  m * m orthogonal 
matrix  Q,  that  is  near  A.  In  order  to  study  the  nearness  of  one 
matrix  with  respect  to  another  matrix,  we  introduce  a norm;  we  use  the 
Frobenius  norm  of  a matrix , viz. , 

||A||  = ( E |a  |2)1/2  . 

We  shall  use  this  matrix  norm  throughout  this  discussion.  Our  problem 

then  consists  of  the  following:  let  A be  an  arbitrary  m * m matrix; 

determine  Q,  6 U such  that 
m 
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II a - q||  < ||a  - x|| 


for  any  X £ |^  . 


i'his  problem  is  important  in  factor  analysis  and  has  also  found  appli- 
•ations  in  aeronautics  (cf.  Bar-Itzhack  [l]!1. 

The  solution  to  the  problem  is  fairly  simple.  It  is  as  follows 
if 

A = UEV*, 


■ : en  we  replace  all  the  singular  values  by  1 and  write 

Q = UIV1  . 

rt  is  well-known  that  the  singular  values  of  an  orthogonal  matrix  all 
equal  1.  Now, 

|| A - Q||  = HuEV^  - UIV^H 

= |[£  - i jf  since  the  Frobenius  norm  is  unitarily 
invariant^  ^ 

= [ (a  -l)2  f (a  -l)2  +■  •••  + (a  -l)2}1^2  • 

12  n 

this  value  then  is  a measure  of  the  departure  from  orthogonality  of  a 
riven  matrix.  The  result  is  true  for  all  unitarily  invariant  norms 
(Fan  and  Hoffman  [5]). 

B.  We  consider  the  following  important  generalization  of 
problem  A.  Let  A be  an  m X n matrix  associated  with  a set  of  data 
and  let  B be  obtained  from  A through  a rotation  of  the  data.  The 
following  figure  may  represent  a typical  situation: 


A norm  is  said  to  be  unitarily  invariant  if  ||AU||  = ||VA||  = ||A|| 
where  U*U  = I and  V*V  = T. 
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Our  idea  is  to  replace  A “by  BQ,  that  is,  we  wish  to  replace  A by 

a rotation  of  B.  We  want  to  determine  Q,  € U such  that 

n 

||A  - BQ||  = min  . 

The  solution  is  again  given  in  terms  of  the  singular  value  decomposition. 
Green  [15]  and  Schonemann  [21]  showed  that  if 

B*A  = VZyt 

and  , 

Q = UV  , 


||A  - BQ!|  < || A - BX||  for  all  X € U . 


C.  Let  "nr  be  the  set  of  all  m X n matrices  of  rank  k. 

(r)  m'n  (k) 

Assume  A6t  . We  want  to  determine  B £ (k  < r)  such  that 

m,n  "<  m,n  — 


||A  - B||  < ||A  - X||  for  all  X € V 


In  other  words,  we  want  to  approximate  the  matrix  A with  a matrix  of 

lower  rank  and  we  want  the  best  approximation  for  the  fixed  rank.  The 

solution  is  given  in  terms  of  the  singular  value  decomposition. 

Let  A = U2Vt,  then  B = UQ,  Vt , where 

k 


m X n 


IIa  - b||  = llusv^  - unkvtH 

= ||s  - nj 

" (Vl  + •••  + an]  • 

Mirksy  [18]  showed  that  the  above  result  is  true  for  all 
unitarily  invariant  norms. 

Consider  the  following  example.  Let 


1 0 


0 10 


Mathematically,  the  matrix  is  of  rank  2.  But  the  following  rank  1 


matrix 


1 0 


0 0 


differs  from  A by  only  10  and  is  the  closest  matrix  of  rank  1 to  A. 


D.  The  singular  value  decomposition  also  enters  in  the  com- 
putation of  the  pseudo-inverse  of  a matrix.  An  n x m matrix  X is  a 
pseudo-inverse  of  an  m x n matrix  A if  it  satisfies  the  following 
four  relations: 


(i)  AXA  = A, 

(ii)  XAX  = X, 

(iii)  (AX)t  = AX, 

(iv)  (XA)1  = XA  , 


We  can  easily 


i 

•v. 

hi 

• vi 


[ *• 

; *• 

: H 

: H 


s 

«■  - 

I 

f 


The  pseudo-inverse  X is  unique  and  we  denote  it  by  A+. 
verify  that  given 


we  always  have 


where 


A = UEV  , 


A+  = VaU1  , 


Consider  the  following  problem.  Suppose  we  have  an  m-vector 
b and  an  m X n matrix  A.  We  would  like  to  determine  an  n- vector  x 
such  that 

||Ax  - b||2  = min.2 

If  A is  not  a matrix  of  full  rank,  we  do  not  have  a unique  solution 
to  the  problem.  Let 

x = {x|||Ax  - b||o  = min)  . 

We  would  like  to  determine  x £ X such  that  ||x||2  is  a minimum. 


27 


llxll2  = ( £yi)1//2  for  x = (y1fy2»***»  yn)*  • 

i=l 


583 


A 


The  solution  is  given  by  x = A b.  Hence  if  we  had  A , it  would  be 

A 

fairly  simple  to  compute  a sequence  of  solutions  {x.}  given  the 

sequence  of  data  [bj. 

J 

Unfortunately,  the  pseudo-inverse  of  a matrix  is  not  a continuous 
function  of  the  elements  of  A.  If  we  let 


A(e)  = 


1 0 


0 e 


where  e > 0,  then 


A+(e)  = 


1 0 


0 e 


A+(0)  = 


1 0 


0 0 


Hence  for  a small  positive  e,  we  see  that  A+(e)  is  quite  different 
from  A+(0).  Thus  the  computation  of  the  pseudo-inverse  is  quite  an 
ill-conditioned  problem. 

If  we  want  to  compute  the  pseudo-inverse  in  a stable  way,  we 
must  impose  some  additional  conditions.  We  shall  give  one  possibility 
which  seems  quite  satisfactory. 

Suppose  we  are  given  a matrix  A but  we  also  know  that  the 
matrix  is  really  some  matrix  B plus  some  perturbation  A,  viz. , 

A = B + A . 

We  do  not  know  B but  we  know  some  bound  on  the  error: 

I|a||  < q j 

for  example,  this  would  happen  if  the  elements  of  A were  empirical 
data  with  known  uncertainties.  We  wish  to  determine  B such  that 


l|A  - B||  < r,  , 


,4 

M 

t-  i 

f j 

V ' 
:-  *! 

*v» 


E.  We  may  use  the  singular  value  decomposition  to  solve 
homogeneous  equations.  Suppose  A is  an  m x n matrix  of  rank  r.  Let 

AV  = UZ  . 

We  partition  V into  an  n*  r matrix  and  an  n x (n-r)  matrix 

i.e. 

V . (Vi; v2)  , 

and 

a(vvv2)  = (UZ^O)  , 

where 


and  we  have  found  an  orthogonal  basis  for  the  null  space  of  A. 

Given  a set  of  eigenvalues  of  a square  matrix,  we  need  to 
solve  a set  of  homogeneous  equations  in  order  to  find  the  eigenvectors. 
Golub  and  Wilkinson  [ 14]  used  the  idea  to  compute  the  Jordan  canonical 
form  of  a matrix. 

Often,  we  wish  to  know  which  columns  of  a given  matrix  A are 
linearly  independent.  If  A is  a set  of  measurements  and  if  some 
columns  are  dependent,  we  may  want  to  determine  which  are  the  dependent 
columns,  eliminate  them  and  obtain  a linearly  independent  set  of  measure- 
ments. The  singular  value  decomposition  can  be  very  effective  for  this 
purpose. 

(n-l) 

Let  A £ w ~ ' and  let  the  last  column  of  A consist  of 
m,  n 

all  zeros.  We  find 


> 


from  which  we  see  we  should  eliminate  the  last  column  of  A. 

In  general,  we  want  to  take  V and  perform  Gaussian  elimina- 
tion  with  complete  pivoting  on  Vg  such  that 

PVgJT  = (^D)  , 

where 
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is  an  (n-r)  X (n-r)  upper  triangular  matrix 
□ is  an  (n-r)  X r matrix,  and 

II  is  an  n X n permutation  matrix. 


Then  if 


An  = (a1,a2)  , 


we  can  decide  that  the  columns  of  A^  form  a linearly  independent 
basis  for  the  columns  of  A.  This  and  other  problems  of  dependence 
are  discussed  extensively  in  a paper  by  Golub,  Klema  and  Stewart  [10], 

F.  Another  problem  is  the  following.  Consider 


It  is  not  difficult  to  see  that  the  maximal  value  of  the  normalized 
bilinear  form  is  cr^,  which  is  attained  when  £ = u^  and  £ = v^, 
where  is  the  largest  singular  value  of  A,  and  u^,  v^  are  the 

corresponding  left  and  right  singular  vectors,  respectively. 

Let  X be  an  m x s matrix  and  Y be  an  m x t matrix.  Consider 

I = Xu  and  ri  = Yv  . 

iv  <v  n*  rsj 

The  angle  0 between  £ and  £ is  given  by 


cos  0 = 


"2  iia»2 


We  can  choose  j;  and  £ to  maximize  the  normalized  inner  product. 

We  call  the  maximal  value  the  canonical  correlation  and  the  correspond- 

A 

ing  angle  (say  6)  the  angle  between  the  two  subspaces  U and  V. 

/v 

We  can  determine  0 very  easily  using  the  singular  value 
deomposition.  We  compute  the  QR  decomposition  of  X and  Y,  viz. 
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Then 


X = QR 


and 


Y = PS  . 


0 = cos-1(a  (Q^) ) . 
max 

The  computation  can  be  carried  out  even  when  X and  Y have  less  than 
full  rank  (Bjorck  and  Golub  [2]). 

G.  One  further  application  of  the  singular  value  decomposition 
is  in  computing  the  parameter  A in  ridge  regression  using  the  cross 
validation  technique  (Golub,  Wahba  and  Heath  [133). 

Given  an  m x n matrix  K of  rank  r and  an  m-vector  g.  We 
wish  to  minimize 

<P(£)  = lljg  - Kf|| 2 + Allfll®  • 

Using  the  variational  technique,  we  see  cp(f)  attains  its  minimum  at 

A A ^ 

f = f where  f satisfies 


(K1*  + AI)f  = K*g  . 


Hence  we  have  a ridge  regression  problem.  The  question  is  how  to 
choose  A.  One  possibility  is  to  try  to  estimate  A from  the  data; 
we  shall  describe  one  method  based  on  cross  validation.  We  shall 
see  how  the  singular  value  decomposition  of  K aids  ns  in  both 

A 

choosing  A and  solving  for  f for  the  chosen  value  of  A. 

Let  denote  the  (m-1)  X n matrix  obtained  by  leaving 

out  the  j-th  row  of  K,  and  let  g^'  denote  an  (m-1) -vector  obtained 
by  leaving  out  the  j-th  component  of  g,  viz. 
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,-ym i i.. 


and 


where  k.  is  the  j-th  row  of  K. 

I(j) 


Let  fvj;(A)  denote  the  solution  to 

(K^'^K^)  + AI)  f^(A)  = . 

^ fSj 

The  cross-validation  weighted  square  error  CV(A)  is  defined  by 

CT(M  • £ w [g.  -k*  f(1)(X)]2  , 

0==!  J J J 


where 


w.  > 0. 
0 


We  wish  to  choose  A such  that  CV(A)  is  a minimum.  We  see 
CV(A)  = Z w.[g.  - k^K^V^  + AI)"1  K^J)t  g^]2 


J=1 


m 


3 3 ~3 


- 2 ^ - Mi)'-1  (H*  - k^bg]*  , 


t\~l  ft l.  tx-,2 


3=  1 


3 3 ~3 


3T3 


where  e.  = (0, . . . ,0, 1,0, . . . ,0)^. 

3 ' v 

J 

We  apply  the  Sherman-Morrison  formula  to  obtain 

(K^K  + AI  - k^)"1  = (^K  + AI)'1  + aT1(KtK  + AI)"^ .k^^K  + AI)' 
~o~j  3 ~j~a 


589 


q>,(?0  = -o-2 — , 

J crl  + A 

J 


and  cr.'s  are  the  singular  values  of  K.  Finally, 

r n2 


CV(  A)  = E w, 


gi  - ui/3(A,g;i 



1 - £ tu  cp  (A) 
j=l  J 0 


which  is  very  easy  to  evaluate. 

For  a chosen  value  of  A,  we  may  solve  the  ridge  regression 
problem  easily  using  the  singular  value  decomposition  of  K.  We  have 


(K*K  + Al)f  = K^g  , 


which  reduces  to 


Hence 


V^Z  + Al^f  = VZ^ 


f = V^Z  + AI)"1Ztc 


where 


r o-.g . 


2 + -x  ~0 


d=l  a.  + A 


v = Z2>  • • • » ~n^  • 

Many  numerical  experiments  have  been  carried  out  in  [13], 


COMPUTING  THE  SINGULAR  VALUE  DECOMPOSITION  OF  A DENSE  MATRIX 


! • 


Our  basic  tool  is  the  Householder  transformation.  Consider  a matrix 


of  the  form 


p(1>  . i 


2u(i)ud)t 


where 


,(D| 


.=  1. 


R] 


Note  that  the  matrix 
denote  the  original  matrix. 


is  symmetric  and  orthogonal 

>(D 


Let  A 


(1) 


We  construct  P 


to  annihilate  all 


elements  below  the  diagonal  in  the  first  column  of  A 


(1) 


F i 


°L 

Q * 

12 

a' 

In 

0 

&22  • ‘ * 

a' 

2n 

0 

a32 

a! 

3n 

am2 


. A<3/2> 


mn 


»(D 


on  the  right  of 


We  next  apply  a Householder  transformation  Q 

A(5/2),  and  our  idea  is  to  eliminate  all  elements  to  the  right  of  the 
(1,2)  position  in  the  first  row  of  A^^2^  without  disturbing  the  zero 
elements  in  the  first  columns 


* 1 

v . 


it 

n 

! i •. 

g 


P 


P1 

0 

0 \ 

a22 

a" 

23 

a2n  > 

\ 

S2 

a33  * * • 

Sn 

a", 

m3 

c,/ 

/ 

Our  process  continues  with 
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0 


^(k+1/2)  = p(k)A(k) 


where  the  effect  of  Pv  ' is  to  eliminate  all  elements  below  the 

(k) 

diagonal  in  the  k-th  column  of  A , and  with 

A(k+1)  = A(k+l/2)Q(k) 

(k) 

where  the  effect  of  Q'  is  to  eliminate  all  elements  to  the  right 
of  the  (k,k+-l)  position  in  the  k-th  row  of 

The  end  result  is  that  we  have  n transformations  on  the  left 
((n-1)  transformations  if  m = n),  and  (n-2)  transformations  on  the 
right  of  A : 


j = p^  ...  P^AQ^  •••  Q^n"2^ 


°i  ei 


a2  P2 


o 


o 


o 


We  now  apply  the  QR  method  due  to  Francis  [ 7]  and  Kublanovskaya  [ 16] 
(Golub  and  Kahan  [9])  so  that 

J = XZYt  , 


i.e.,  the  singular  value  decomposition  of  J.  If  we  write 


= pW  . . . p(n) 


P = P 


and  Q = Qv 


,(n-2) 


then 
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where 


A 


U = PX,  V = QY  . 

The  first  program  to  do  the  above  computations  was  given  by 
Golub  and  Reinsch  [12].  A version  for  complex  matrices  was  given  by 
Businger  and  Golub  [3]  A program  for  real  matrices  is  available  in 
Release  2 of  EISPACK  [4]. 


4.  COMPUTING  THE  SINGULAR  VALUE  DECOMPOSITION  OF  LARGE 
SPARSE  MATRICES.  We  have  several  possibilities  for  computing  the 
singular  value  decomposition  of  a large  and  sparse  matrix.  In  most 
problems,  we  want  only  the  few  greatest  singular  values  of  a large 
matrix;  for  instance,  in  image  reconstruction,  the  order  of  the  matrix 
frequently  exceeds  10,000  but  only  veiy  few,  generally  less  than  100, 
of  the  greatest  singular  values  are  of  physical  significance. 


A.  Standard  Lanczos  algorithm.  The  best  available  algorithm 
for  computing  a few  of  the  greatest  singular  values  of  a large  sparse 
matrix,  say  A,  is  the  Lanczos  algorithm.  The  algorithm  uses  the  matrix 
A only  in  the  computation  of  the  matrix-vector  product  Ax  or  A x 
given  a vector  x.  Hence  we  can  use  the  sparsity  of  A to  compute 
the  products  very  efficiently.  Unlike  other  methods  that  transform 
the  matrix,  the  Lanczos  algorithm  preserves  the  matrix' s sparse 
structure  and  works  well  even  if  the  matrix  is  so  large  that  it  has  to 
be  stored  on  some  auxiliary  device  (e.g.  magnetic  disk  or  tape). 

We  use  the  Lanczos  algorithm  to  bi diagonalize  a given  m X n 


matrix  A; 


where 


A = PJQ' 


ptP  = I , 
m 


QtQ  = I 


) 


“l  B1 


We  can  expand  the  two  resultant  equations: 

AQ  = PJ  and  P*A  = JQt  , 

in  terms  of  the  columns  of  P and  of  Q to  yield 


a3i  = ai£i  ' 

A%+1  “ + ai+l£i+l  * 

£iA  = ai3i  + h£+i> 
£nA  = Vn  * 


l — f 


So  our  algorithm  is 

(1)  Choose  ^ such  that  ll^llg  = 1. 
Set 


“l  = llwi||2, 
£i  = \\  • 


P! 


r 


v ? 


! 

h 

* i 

n 


(2)  For  i = 1, 2, . ,.,s-l  (2  < s < n),  compute 


S±  = A JBi  - ai3i  » 


<9i+l  = :'i  ~i  ’ 


*i+i  = A«9i+i  - pi£i  > 


ai+l  “ "~i+l"2  * 


£i+1  = a— w 


-1 
i+l~i+l 


For  some  s < n,  we  denote 


r(s) 


,(s) 


- (£]_>  £2’  ’ £s)  * 


and 


,(s) 


Q = (si*  £2’  •••  ’ ss)  • 


We  now  apply  the  QR  method  on  J ' ' so  that 


(s) 


The  cr|S^,  P^x|S\  and  are  usually  accurate  approximations 

to  the  largest  singular  value,  and  the  corresponding  left  and  right 
singular  vectors,  respectively,  of  A.  We  may  apply  the  Kaniel-Paige 
theory  [19]  to  show  that  if  9 > 0 is  the  angle  between  and  v_. 


2 (s) 

CT1  " el  < CT1  < l ^ 


where 


2 

el 


2cr^  tan  9 
s-ivi  - rJ 


U 

ts 


i 


and 


is  the  (s-l)-st  Chebyshev  polynomial  of  the  first  kind, 


We  construct  an  example  to  show  how 


generally  approxi- 


mates cr,  well  even  for  a small  s.  Let  cr,  = 1.0,  = 0.9,  s = 20 

and  9 = cos  0.1.  Then 


and 


tan2  9 = 1 ~ Pg1  = 99  , 

0.1 


r = 2(ii)0,9  = °*°5  ' 


rH fc  ^ 


T19(1.105)=  2.8  x 105  . 


Hence 


and 


el  = 2 ' 1 ' l9 2 ~ 2*5  x 10"5 

1 (2.8  x 10 ^)2 


0'1  - 0.000025  < o-p0)  < a1  . 


Since  n is  usually  very  large,  we  often  choose  some  s « n 

subject  to  storage  availability.  If  our  convergence  criterion  for  the 

(s)  ( s ) 

singular  value  is  not  satisfied,  we  may  use  Q as  the  new 

initial  vector  and  restart  the  Lanczos  algorithm.  Since  the  accuracy 
of  our  approximation  is  bounded  by  tan  0,  where  9 is  the  angle 
between  our  initial  vector  and  v^,  we  expect  to  obtain  better  approxi- 
mations if  we  iterate  the  Lanczos  algorithm.  If  z.  (or  w. ) = 0 
for  some  i < s,  we  could  continue  the  algorithm  by  choosing  some  z^ 
(or  w. ) orthogonal  to  all  the  previous  z.’s  (or  w.’s),  j < i.  We 

could  also  choose  to  terminate  the  algorithm  because  z.  (or  w. ) = 0 

~i  ~i  ~ 

usually  means  some  singular  values  have  converged. 

The  sequences  of  vectors  and  {c[^}  form  orthogonal  sets 

in  exact  arithmetic.  Hence  theoretically,  we  need  only  to  keep  the 
most  recent  pairs  of  jd s and  c^'s  in  memory,  providing  great 
savings  in  storage.  Unfortunately,  the  sequences  {jd^}  and 
generally  lose  orthogonality  very  quickly  due  to  cancellation  errors 
in  the  computations  of  the  z^'s  and  w^'s.  A remedy  is  to  reorthogo- 
nalize  the  most  recently  computed  (or  with  respect  to  all 
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the  previous  jg.'s  (or  cj.'s),  j < i.  But  this  task  is  expensive  in 
both  execution  time  and  storage,  because  we  must  now  store  all  the 
computed  (jx)  and  in  memory.  Paige  [ 19]  argues  against  the 

necessity  of  reorthogonalization,  but  the  matter  is  still  a subject  of 
controversy. 


B.  Block  Lanczos  algorithm.  In  many  cases  we  may  save  work 
if  we  iterate  with  a block  of  vectors  instead  of  a single  vector.  The 
saving  could  be  considerable  if  we  were  computing  a multiple  singular 
value.  In  general,  if  we  had  some  a priori  knowledge  of  the  singular 
value  spectrum,  we  could  choose  an  appropriate  block  size  with  good 
gains.  Computer  experiments  (Golub,  Luk  and  Overton  [ll])  show  that 
if  we  want  several  of  the  largest  singular  values,  we  often  gain  by 
choosing  a block  size  p > 1.  Also,  if  the  matrix  is  stored  on  an 
auxiliary  device,  we  may  make  some  gains  in  efficiency  if  we  multiply 
the  matrix  into  several  vectors  simultaneously. 

In  a similar  way  to  the  standard  Lanczos  algorithm,  our  block 
version  reduces  the  matrix  A to  a block  bidiagonal  form.  We  start 
with  an  arbitrary  n X p matrix  Q^,  and  perform  a QR  factorization  of 


the  product  AQ^: 


P1A1  AS.  ’ 


r 


* 


where  is  an  m x p matrix  such  that  P^P^  - 1 > 

and  A^  is  a p x p upper  triangular  matrix. 

Our  algorithm  continues  with 


P.A.  = AQ.  - P.  B.  . , 
11  i l-l  l-l 


where  QJ3^  ^ and  PiAi  are  ^ factorizations  of  the  respective 
right-hand  sides,  and 


is  an  n x p matrix  such  that  = I , 

P.^  is  an  m x p matrix  such  that  P^P^  = I , 


and  both  Bi_1  and  are  p x p upper  triangular  matrices. 

We  have  tacitly  assumed  p x s < n.  We  consider  the  ps  x ps 
block  tridiagonal  matrix  Jv  '• 


which  is  also  banded  upper  triangular  with  bandwidth  = p+1. 

(s) 

We  can  reduce  <T  to  bidiagonal  form  using  the  Householder 

transformations.  We  can  also  use  plane  rotations  to  reduce  to 

bidiagonal  form  to  take  advantage  of  the  sparse  banded  structure  of 

( s ) 

J . A plane  rotation  in  the  (i,  j) -plane  is  an  orthogonal  matrix 
of  the  form 


2 2 

where  y + cr  =1.  It  is  easy  to  verify  that  given  a vector  we 
can  choose  y and  a such  that  P.  . annihilates  the  j-th  component 


of  x. 


cse  y and  c such  that  P,  . annihilates  the  j-th  component 

iJ  (s) 
We  give  a simple  example  to  demonstrate  how  we  can  reduce  J ' 


using  plane  rotations. 

Suppose  we  have  the  following  6x6  matrix 


x x SI 


O 


,o 


XXX 


We  construct  a plane  rotation  Q postmultiplying  A to  annihilate 
the  (1,3)  element.  The  rotation  creates  a non-zero  element  in  the  (3,2) 
position,  i.e.. 


o 


XXX 


D 


XXX 


Now  we  apply  a plane  rotation  P^y  premultiplying  AQ^  to  eliminate 
the  (3,2)  element.  A new  non-zero  element  appears  in  the  (2,5)  position: 


P ,AQ  , 
23  23 


x x x B 


O 


O 


We  construct  to  annihilate  the  new  nonzero  (2,5)  element  from  the 

right: 


601 


X X 


An  appropriate  plane  rotation  from  the  left  will  annihilate  the 

newly  created  (5,4)  element  without  creating  new  nonzero  elementst 


We  say  we  have  "chased"  away  the  (1,3)  element  of  A 


(cf.  Rutishauser 


[20]). 


We  may  determine  the  singular  value  decomposition  of  the 

resultant  bidiagonal  matrix  using  the  QR  method.  Using  a theorem  due 

to  Underwood  [22],  we  can  show  that  the  p largest  singular  values 

( s ) 

of  J are  usually  accurate  approximations  to  the  p largest 
singular  values  of  A.  In  fact,  if  a Q >0  is  the  smallest  singular 
value  of  where  consists  of  the  first  p columns  of  V, 

then  for  k = 1,2, ...,p. 


where 


2 (s) 

ak  * ek  ^ ak  £ °k  ' 


2 

£k 


(crl  + ak) 


tar.2  9 

2 1 + rk 

s-ri  - rJ 


j 
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-1 

9 = cos  a 


min  ’ 


r = \—£+1 
k ak  + a1  » 


and  T , is  the  (s-l)-st  Chebyshev  polynomial  of  the  first  kind. 
s-1 


the 


We  consider  an  example  similar  to  the  one  we  have  given  in 

previous  section.  Let  or  = 1.0,  a = 0.9,  cr,  = 0.5,  P = 2,  s = 10 
-1  ° 


and  6 = cos  0.1.  Then 


tan  9 = 99, 


°1  " °3  0.5 


'i  - srnr = 27o  - ^ 


°p  “ o 4 

v - d - u * H i 0 21 

r2  " <x0  + o,  1.9  ' ' 


1 + rl  _ 1.25 

1 - r1=  0.75 


= 1.67  , 


1 + y, 


2 - 1.21  • 


i - r2  0.79 


= 1.55  , 


Hence 


T (1.67)  = ITT  , 
T9(1.53)  = 5.7  x 105 


2 ^ 2 x 99  t 2.0  x 10-6 

1 10° 


and 


e2  " 1,?~  = l-4  X 10"5  • 

2 (5.7  X Kr  )2 


Comparing  the  two  examples,  we  can  see  how  a proper  choice 
of  the  block  size  would  save  us  work  with  the  same  limitation  on 
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storage  space.  In  general,  a good  choice  of  p depends  on  the  singular 
value  spectrum,  the  number  of  singular  values  desired,  and  the  avail- 

A 

ability  of  memory.  If  there  is  a cluster  of  p largest  singular 

A 

values,  it  usually  pays  to  choose  p = p.  Often,  the  knowledge  is  not 

available  and  a satisfactory  rule  appears  to  be  choosing  p equal  to 

the  number  of  singular  values  we  want  to  compute.  Our  tests  [ 11J 

show  that  the  reorthogonalization  of  each  recently  computed  P (Q^) 

with  respect  to  all  the  previous  P.'s  (Q.'s),  j < i,  is  necessary  for 

J J 

accurate  results.  We  therefore  must  keep  all  the  P.'s  and  Q.'s 

11 

in  memory,  effectively  bounding  the  value  p X s. 

An  algorithm  will  soon  be  published  [ 11] . 
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